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ABSTRACT 


One application of the theory of formal 
Operations is that absence of formal-operational struc- 
tures in students entails tegtaonaeaeene on curriculum 
and instruction. The present study represents an 
empirical test of whether this idea of intellectual 
structural limitations applies to secondary school 
students (grades 7 through 12). Tests of volume 
conservation, controlling variables, and the combinatorial 
system were administered to a sample of grade 7 students 
and to a sample of grade 9 students, to test the 
hypothesis that a majority of secondary school students 
is concrete-operational. Tests were administered to 
groups of students and employed objective scoring criteria. 
LwOgOne-hOUmmeInstluctiOnad esequences, in controlling 
variables and in generating combinations, were administered 
to the grade 7 sample over a one-week period. The sample 
was retested two days after and one month after the 
instructional sequences to test the hypothesis that 
Substage III-B behavior on tests of controlling variables 
or generating combinations cannot be induced in a short 
pericd of time (seven days). Of the grade 9 sample, 80% 


were judged to display Substage III-A behavior and 50% 


iv 


were judged to display Substage III-B behavior. Of 

the grade 7 sample, 51% were judged to display Substage 
III-A behavior and 47% were judged to display Substage 
III-B behavior on the pretest, while 64% were judged 

to display Substage III-B behavior on the posttest. 

No significant change occurred from posttest to retention 
test. The increase from pretest to posttest in the 
proportion of the grade 7 subjects classified as Substage 
III-B is significant (x? = 3.71, df = 1, p< .05). The 
increase from pretest to retention test in the proportion 
of the grade 7 subjects classified as Substage III-B is 
Significant (x? BO. OGped eel) De< -5005)Thesresuits 
may be taken to indicate that the idea of intellectual 
structural limitations should not be applied to the 


Majority of secondary school students. 
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CHAPTER I 
INTRODUCTION 


The present study is an attempt to test the 
validity of a proposition currently being advanced by 
SOMemeaucalcOLGm(.. Q meballacmoayre, 19747) Renner et al, 
1973; Lawson, 1973; Lawson & Renner, 1974; Wood, 1974). 
The proposition is that many or perhaps most secondary 
school students are not "formal operational" and are 
not able to learn certain concepts, nor to understand 
puvedy sc Ormalsreasoning. |For, instance Renner et al 
GUo7Sjecialmuthave no less than //%,0f junior high 
students is concrete operational, and in order for 
these students to profit educationally and intellectually 
from school, they must (not should, must) be interacting 
with concrete objects, events, and situations." To 
quote further: 

This is particularly true at the tenth-grade 
level, where biology has traditionally been 
taught and most of that) population enrolis in it. 
Any teacher facing a biology class can be nearly 
certain that about 70 percent of those present 
are concrete operational thinkers. Now consider 
such concepts as the DNA molecule, gene mutations, 
and the chemical basis of life. From our per- 
spective, such topics and concrete operational 


learners are grossly incompatible [p. 347]. 


Ball and Sayre (1974) put it this way: "Quite 


because they have not developed "the mental framework 
necessary for abstract thinking." 

The idea that limitations in intellectual 
structures place limitations on what students may be 
able to learn or understand may be termed "the idea 
of intellectual structural limitations." This idea 
will be defined in more detail below, but the problem 
to which the present study is addressed is whether this 
idea applies to secondary school curriculum and instruc- 
tion. (Secondary school is taken here to mean grades 
/echnroughs 2: 

The idea of intellectual structural limitations 
has been questioned before. Sullivan (1967, 1969) has 
cautioned that "the use of Piaget's stages as indicators 
of ‘learning readiness' seems most premature and needs 
more careful consideration on both the research and 
theoretical levels [1967, p. 33]." Hamlyn (1971) says 
that to apply stage theory to education in this way 
may be "educationally disastrous," for there is the 
"necessity within education of trying to teach people 
what they cannot understand [p. 23]." Ausubel and 
Robinson (1969) see a danger that: 

of) dott Literal interpretation of the terms used to 
describe Piaget's stage of formal operations may 
be causing some educators to put an unnecessarily 


low ceiling on the complexity of intellectual 
tasks to which young children are subjected [p. 200]. 


While the above objections are based on conjec- 
ture, the present study represents an empirical test of 
the idea of intellectual structural limitations for 
the particular case of its applicability to secondary 
SCuoO CUE CULUMednGmg mSLLUCELON,. 

A case will be made that the applicability of 
the idea of intellectual structural limitations to the 
secondary school rests on the validity of three state- 
ments: 

(a) The onset of the stage of formal operations 
(Substage III-A) does not occur for a majority of 
Students until aie Grareernsiavay school years or later. 

(b) “the full realization of formal operations 
(Substage III-B) is not attained for a majority of 
students during the secondary school years. 

¢€) Progression from Substage III-A to Substage 
ITI-B is necessarily slow (of the order of years) and 
is not subject eee ere acceleration in the 
secondary school years. 

Ttewills then besvargquedethatstousseructunal 
limitations to be applicable to secondary school either 
(a) must be true, or else (b) and (c) must be true 
Simultaneously; and that to establish the empirical 
falsity of (a) and at the same time either (b) or (c) 


would be to destroy the connection between the theory 
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Cierorma | ene ceene and the idea of structural limita- 
tions. The present research aims to do this. 

The three statements will be cast into testable 
form by operationally defining the terms "attainment 
of Substage III-A," “attainment of Substage III-B," 
and “appreciable acceleration," so that ultimately the 
questions to which the present study seeks answers are 
threefold: 

1. What is the incidence of students in secondary 
schools who fail tests (to be specified) designed to 
detect (a) recognition of the need to control variables, 
and (b) the acquisition of volume conservation? 

2. What is the incidence of students in secondary 
school who fail tests (to be specified) requiring (a) 
experimental control of variables, and (b) generation of 
the set of all subsets from a set of four elements? 

3. In an experiment with age and training as 
independent variables and acquisition of Substage III-B 
as criterion, are the following effects significant? 

(a) training main effect; 


(b) age-training interaction effect. 


DER INEDPION OF TERMS 


The idea of intellectual structural limitations 

A prerequisite to definition of the idea of 
intellectual structural limitations is to establish 
the meaning of the term structure. In Piagetian theory, 
thinking is the act of performing mental operations. 
These operations or mental actions are not isolated 
entities but are organized and coordinated in systems 
or structures. Stages of intellectual development are 
characterized by their structures: eight structures 
called groupements Pimms Lge mOlmcCOnerete sOperatlons 
(Stage II in the theory); and two structures called 
the combinatorial system and the four-group in the stage 
OPertormaly operations. or Stage IIL (Piaget, 1950, p. 36: 
Poot app emce- 9); = )964D;) 1970; Prager & Inhelder, 1968, 
p. 329). In Piagetian theory, the stage of formal 
Operations is divided into Substages III-A and I1I-B, 
with Substage III-A earssacrirs Woakee the "genesis" of the 
structures and Substage III-B PieglP oteretactiee POnn OT 
“equilibrium: =with-ayperiod of the order of) three 
years normally intervening between the substages (Inhelder, 
LES A Opes WE SAAS he 

Themrdcasof jintellectual structural limitations then, 


LomOeCe INCOM EOECOnS) Stl Ol tWwOrnocLOns . wine rirst. notion is 


that if structures are absent in a given subject, then 
forms of reasoning which depend on these structures 

will have no significance for the subject. He will not 
use such forms of reasoning, nor understand them. The 
second notion is that certain concepts may be impossible 
for a subject to learn if he lacks the eer ears 
structures, because in Piagetian theory, subjects 
construct scientific concepts for themselves through 
interaction between their intellectual structures and 


empirical experience (Piaget, 1970). 


Substages JII-A and JII-—-B 


A given subject will be said to have attained 
Substage III-A if either 

(a) he may be classified as a conserver of 
volume, or 

(b) he gives evidence of seeing the need to 
control variables in an experiment involving two or 
more independent variables. 

A subject will be said to have attained Substage 
TilaBeriees chen 

(a) he demonstrates the ability to devise a 
controlled experiment involving two or more independent 
variables, or 

(b) he is able to generate all members of the 


set of all possible subsets of a set of four elements. 


By, - poe 
{iucasataun 


Te 


These definitions will be justified in Chapter III, 
where precise operational definition of attainment of 
Substages III-A and III-B will be given in terms of the 


tasks administered. 


Tireaining and acceleration 
The development by a subject of Stage III 


structures from genesis to complete form takes about 
three years "in a rich cultural environment [Inhelder, 
19707 p. 25]. For the present purposes then, 
acceleration may be defined in relation to development 
within Stage III as any reduction below three years in 
the average time between Substages III-A and III-B. 

The term appreciable acceleration is defined 
as the experimental induction in a group of Substage III-A 
subjects of Substage III-B performance on a given task in 
a period pemene order of weeks. This definition also will 
be justified below. 


Thesmajority of -aspopulation Sis “more’vthan 5037 


DEFINITION OF THE PROBLEM 


The central question is whether the idea of 
antellectual structural limitations is applicable to 
at least a majority of secondary students. If most 


secondary school students have not entered Substage III-A, 
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StMuctUutelmiimitatrolscmeOoulLa, apply inea straigitLorward 
manner. The structures are absent and any learning which 
may rely on the structures is theoretically not possible. 
On the other hand, if most secondary school students have 
attained Substage III-A but not Substage III-B, then the 
idea of intellectual structural limitations might still 
apply, but only if a given curriculum item were seen to 
require full realization of formal operational structures 
for proper comprehension. In that case, two courses of 
action would be possible. The item could be postponed 
untcplesubstage Lil-B, sorean acceleration to Substage III-B 
in the particular content area couid be undertaken (and this 
amounts to teaching the item). If acceleration over a 
matter of days were possible in a given area of curriculum 
content then there would be no necessary structural basis 
for postponing content. 

Consequently, the relevance of the idea of 
intellectual structural limitations to secondary school 
rests on the validity of three statements: 

(a) Substage III-A is not attained for a majority 
of students until the secondary school years or later. 

(b) Substage III-B is not attained for a majority 
of students during the secondary school years. 

(c) Progression from Substage III-A to Substage 


III-B in a given content area is necessarily slow (of the 


order of years) and is not subject to appreciable 
acceleration. 
The aim of the study is to establish the empirical 


falsity of (a) and at the same time either (b) or (c). 


PROCEDURE 
The study has two parts: a survey and a training 
experiment. In the survey, four tasks were administered 


to a sample of grade 7 students and a sample of grade 9 
students. Two of the tasks were designed to test for 
Substage III-A and two were designed to test for Substage 
III-B. In the experiment, two one-hour instructional 
sequences were administered to the grade 7 sample in 
three 40-minute sessions over a period of one week. The 
two Substage III-B tasks and one of the Substage III-A 
tasks were administered two days later, and again one 


month later. 


DELIMITATIONS 


The problem is delimited to the application of 
thesideasot estructural lini tatlons elo Une iedyatiiny of 
secondary students,Neither the idea itself as an 
application of the theory of formal operations nor the 


theory of formal operations as such is under test. On 
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the other hand, the validities of these two (the idea 
and the theory) are not necessarily assumed. 

Dhewvalidity Of application, of the idea of 
structural limitations to elementary pupils or to 
special groups of secondary students (such as low 
ability students) is not under test. | 

A further delimitation is that neither the 
specific nature of the instructional sequence nor the 
relative efficacy of different instructional sequences 
is a prime consideration. The prime consideration in 
relation to the instructional sequence is whether 
Substage III-B behavior can be induced in secondary 


school students in a short period of time. 


OUTEINE OF THE REPORT 


In Chapter II, the literature related to age 
norms for the stage of formal operations, and training 
experiments in formal operations is reviewed. In 
Chapter III, the tests are described, their validity 
discussed, and the method of phere administration 
described. The data are presented, analyzed and dis- 
cussed in Chapter IV. Chapter V contains a summary 


with conclusion and recommendations for further research. 


ao 
7 


ho at Bem (box 
+ pie 


CHAPTER 11 
REVIEW OF RELATED LITERATURE 


This chapter has two sections. The first section 
is a review of a number of reports of research which 
include the administration of formal-operational tasks. 

A very wide variation exists in the proportions of 

subjects classified by the various experimenters as 

being in the period of formal operations. In the review, 
this variation is seen to be largely the result of wide 
Variation in the criteria used by the various experimenters 
to classify subjects as formal operational. The second 
section of the chapter is a review of a number of experi- 
ments involving training in formal operations, concluding 
with a brief discussion of some theoretical problems 


associated with training experiments. 


AGE NORMS FOR FORMAL OPERATIONS 


The Genevan group consistently finds a majority 
of subjects in Substage Tl1-A by the age ,o£& 1b or 12, 
and in Substage III-B by the age of 14 or 15. These 
researchers do not een give detailed descriptions of 


the samples studied, but apparently the approximate ages 


for the substages given in The growth of logical thinking 


Veal 


12 


(Inhelder & Piaget, 1958) were based on "a somewhat 
privileged population [Piaget, 1972]." The method used 
in these experiments is for a trained experimenter to 
pose a problem to the subject and to observe the subject 
as he attempts to solve it. The experimenter takes note 
of all the subject says and does; not confining his 
activity to observing but actively questioning the 
subject. On the basis of this observation, the experi- 
menter makes a subjective judgment as to the stage level 
of the subjects. 

Other experimenters who have attempted to replicate 
or extend the Genevan experiments with other populations 
have not generally confirmed these age norms, and have 
found variation in age norms according to the particular 
task used. In reviewing these experiments it will be 
convenient to adopt a special designation for tasks 
based on those in The growth of logical thinking. These 
tasks will be represented by the letters GLT followed 
by the chapter number in which the task is described. 
For example, GLT-4 represents a task based on the 
pendulum problem in Chapter 4 of The growth of logical 
thinking. 

Higgins-Trenk and Gaites (1971) tested 162 sub- 


jects (ages 13 to 18).in a Wisconsin high school... They 


13 


Classified 43% as formal operational on a volume con- 
servation task, and 32% as formal operational on "open- 
ended situational dilemmas." Higgins-Trenk and Gaites 
concluded that "the normal adolescent is unlikely to reach 
the level of formal operational thought until his late 
teens or early twenties if he reaches it at all, and 
these results suggest he may well not." 

Renner and’ Stafford (1973) tested. 588. students 
in grades 7 through 12 in Oklahoma secondary schools. 
In grade 7, 13% were classified as "post-concrete 
operational," and a further 3% as formal operational. 
In grade 12, 15% were classified as "post~-concrete 
operational," and a further 18% as formal operational. 
Scores were allotted to subjects according to their 
Stage level on each task as follows: Substage II-A - 

1 point, Substage II-B - 2 points, Substage III-A - 
3 points, and Substage III-B - 4 points. Scores for 
individual tasks were added and subjects scoring 12 to 


14 points were classified as "post concrete," while 
subjects scoring 15 or 16 points (the possible) were 
classified as "formal operational." The tasks were 
administered by means of individual interviews, but 
only 20 minutes were given for the six tasks. Subjects 


must have had little time to think about their responses, 


and experimenters must have been forced to make snap 


14 


judgments. The criterion of perfect or near-perfect 
scores for classification as "formal operational" is 
a very rigorous one. 

Friot (1970) tested 210 grade 8 and 9 students 
in an Oklahoma junior high school at the beginning 
and end of a school year. Subjects were classified 
as being in the stage of concrete operations, ina 
"transitional stage," or in the stage of formal opera- 
tions. On the pretest, 18% were classified as being 
beyond concrete operational (that is either transitional 
OmecOrmal)ymwananOnm ence POSELeSst ,506.m Vablation ,accord— 
ing to which science course was studied was found in 
the posttest. Like Renner and Stafford, Friot used 
a battery of six tests: conservation of quantity, 
conservation of volume, GLT-1, GLT-2, GLT-3, and GLT-4. 
Points were allotted in the fashion of the Renner and 
Stafford study. Further inspection of Friot's results 
shows that 66% of the grade 8 subjects gave formal- 
operational responses to at least one of the tasks on 
the pretest. This is over three times the proportion 
classified by Friot as transitional or formal. 

Lawson (1973) classified 84% of 134 high school 
biology, chemistry, and physics students in Oklahoma 


as being beyond Stage II: 28% as "postconcrete," 38% 
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as Substage III-A, 14% as “transitional formal," and 
4% as Substage III-B. Lawson also used a battery of 


six tests and a points scoring system. 


Teena eieecteey (roy 2] tested=3s0sq1 51S ,. mean 
age 11.9, from a California school and 30 women, mean 
age 19.7, from Trenton State College, New Jersey. 
Thirty-three per cent of the girls were classified in 
Stage III: 29% in Substage III-A and 4% in Substage 
III-B; while 67% of the college women were classified 
in Stage III: 41% in Substage III-A and 26% in Substage 
BRT =n? The tasks used were GLT-3, -4, and -11l. Indivi- 
dual interviews were used and subjective judgments as 
to stage level were made with the aid of sets of 
"behaviors characteristic of each stage of development." 
Tomlinson-Keasey reported "ambiguities and difficulties 
involved in judging each individual's protocol.“ A 
training session followed the three tasks, and the 
results of the posttests are reviewed in the second 
section of this chapter. 

Wheeler and Kass (1974) tested high school students 
in Alberta. They classified 72% ahi 99 grade 12 chemistry 
Stucentsmiieo acco lil. sOlepinecubstage. ll l-A ands in 


Substage III-B. Two versions of GLT-7 were administered 
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to groups of subjects in written form. 

Jackson (1965) tested normal and subnormal 
children in England using five tasks: GLT-2, -4, -8, 
pote erand «= 2ee abonty—e1 ght) normalisubyectsiiwere 
tested in each of the age groups, 11, 13, and 15 years 
Old. Of the ll-year-old subjects, 6% were classified 
as Substage III-A (none as Substage III-B). Of the 
13-year-old subjects, 42% were classified as Stage III: 
38% as Substage III-A and 4% as Substage III-B. Of 


the 15-year-old subjects, 48% were classified as 


\ 


Stage III: 38% as Substage III-A and 10% as Substage 
III-B. Variation was found according to task administered, 
the proportions of subjects classified as Stage III ranging 
£romy6% on GLI-20to 383%) ons GLT—4< 
StCepuensected leo 1) administercdwagbatteryaeor 
29 Piagetian tasks to 75 normals and 75 retardates as part 
of a longtitudinal study. The mental ages at which 66% 
of subjects succeeded on the tasks were determined. Five 
different formal abilities (related to volume conservation) 
were attained by normals (IQ 90-110) according to the 
66%ecriterionLatamental agesmi,glsyeloviigg.and 24° 
Bredderman (1973b) tested 27 grade 5 and grade 6 
American students at three one-month intervals ina 
training experiment. thinethesfinstytest, sthemproportion 
ofsstudents-in stages! II, III-A, and III=-Bi were’ respectively 


41%, 55%, and 4%; and in the third test, 8%, 44%, and 483%. 


aay 


Bredderman used a test of controlling variables (GLT-3) 
in which the experimenter set the problem to a single 
subject by means of a set of prepared questions. 
Bredderman's experiment will be considered again below, 
under the heading of training experiments. 

One indicator of formal thought which has been 
used by a number of experimenters is volume conservation, 
which is supposed to be an early acquisition in Stage 
Pileelnhetcen ss Piaget, 19568, )p. So)mmestudiestinvolving 
volume conservation have been reviewed elsewhere (Hobbs, 
1972). Reported proportions of ll-year-old subjects 
classified as volume conservers vary from 41% (Lovell 
& Ogilvie, 1961) to 80% (Inhelder & Piaget, 1958). 

In a review of literature related to the period 
of formal operations, Farrell (1969) concluded that while 
experiments at Geneva had shown that formal operations 
"begin to emerge" at the "age of about 11," other research 
indicates "some flexibility as to the age at which formal 
operations begin to develop and that at which the structure 
is in equilibrium." Farrell noted the’ "possible effect" 
of cultural differences (including schooling) on the age 
norms. 

In a more recent review, Howe (1974) concluded 
that the results of the experiments by Renner and Stafford 


(1972), Stephens et al (1971), Lovell (1961; 1968), 
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e « - and other studies make it clear that only 

a minority of adolescents seem capable of, or 

find it necessary to, use this mode of thought 

[formal operations]. A few are able to solve 

vtcago=iiiie A problems fas fearliyiras*1 tor 2 /Wbut 

most cannot solve truly formal operational problems 
PRU Jeeta Gerri neethicestcenseom later eh f.tatealMs, 


Howe's conclusions may be a little premature in 
view of the variation in percentages of 13-year-old 
subjects (grade 7 level) classified as Stage III by 


various experimenters: 


16% --- Rennemiis Stat ford (1973) 
36% --- Bikands (2 6c) 

42% --— Jackson (1965) 

603% --- Halen ( Gomis) 

66% -—- Stepirenss etajadee(al 95/51 ) 
7153 --- Somerville (1974) 


Any comments on the wide disparity in these 
results would be superfluous. A weak explanation of the 
variation is that the proportions are of different popula- 
tions in different cultural milieux (all populations are 
English speaking, middle-class, normal children). The 
disparity is more likely to be due to the variation in 
criteria for stage classification used by the various 
expenimentérsm 'Partof tthe: variation an) ‘criteriallisivtin 
the choice of task. Piagetian theory gives no basis for 
comparison of results obtained on different tasks which 


are supposed to test for the same structure (Piaget, 1971). 
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BROcherepaLteOLrstnesvalLralLronmin Criteria 1S in deciding 
just what behavior is to constitute guidelines for 
experimenters' judgment on a given subject. An indica- 
tion of the complexity of this problem is provided by 
Somerville (1974). In Somerville's study a sample of 
236 Australian children from 10 to 14 years old was 
given the pendulum problem (GLT-4). The experimenter's 
aim was to develop a "detailed scoring system for per- 
formance." Children were tested individually, and 
"presentation of the task and the method of questioning 
were based as closely as possible on the work of the 
Original authors [Inhelder & Piaget]." Subjects were 
rated on a three point scale on each of 12 aspects of 
their performance. In accordance with the "patterns 

of performance ratings" found, nine stages of development 
were defined, two corresponding to Substage III-B, three 
corresponding to Substage III-A, and four corresponding 
to Stage II. Of the 13-year-old subjects, 75% were 
Classified as ,StageslIi including 332fase Substage 1 Ili-A 
and 42% as Substage III-A. Somerville's complex scoring 
system is indicative of the wide variety of behavior 
patterns which may be found at a given stage level on 


a given task. 
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TRAINING STUDIES 


Four training experiments in formal operations 
Nave been reviewed by Howe 41974): Fishbein et al (1970b) 
successfully trained most of 60 subjects, some as young as 
10 years, in the combinatorial system. They claimed 
SSeLUCtCULalmwacquis) tion ws byethneir Sings) Chatelsy 
learned ability which generalized to a different problem. 
Tomlinson-Keasey (1972) trained 72 subjects in GLT-3, -4, 
and -ll in a 45-minute session. Gains were found in 
Stage level on an immediate posttest consisting of the 
same tasks, but these gains did not persist or transfer 
according to a test one week later using an altered 
version of GLT-3 and versions of GLT-7 and -12. Brainerd 
and Allen (1971) successfully trained "density conserva- 
tion" to 52 grade 5 students in a single session. Trans- 
fer to ore GOnSsenVat 1OnmWasmLOUnG -mmpoclG 1 Cr ictal 173) 
taught 10- and ll-year-old subjects (N = 24) to solve 
the pendulum problem (GLT-4). After a single training 
session "the 10- and ll-year-olds were able to exercise 
formal operations logic and to produce solutions closely 
resembling those cited by Inhelder and Piaget as exemplify- 
ing the highest stage of reasoning." 

In a review of training in both concrete and 


formal operations Beilin (1971) expressed surprise at "how 
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little research is addressed to" the training of formal 
operations. He found the experiments reviewed to be 
unsuccessful in inducing skills related to combinatorial 
Chinki nga Entice Gmalleamis Co mAnderson,ecl9 65) wor uncon= 
vincing because of absence of a test of transfer (Fishbein 
Cuma Deel 970 2')9: 

Successful training experiments have been reported 
by Case and Fry (1973) and Bredderman (1973b). Case and 
Fry trained a "group of 15 low SES 14-year-olds who had 
not yet reached Piaget's stage of 'formal operations'" in 
designing controlled experiments and criticizing poorly 
controlled experiments. After 12 weeks the experimental 
SuCupaperbormedsisagni ficantly bettera(t j=58.4.4 df. =..28, 

Pryce -00l) mchangagmatbched ycontrol-group jon.a test related 
to controlling variables. Since controlling variables 

is a formal-operational scheme, this may be considered 

to be a successful training experiment in formal operations. 
Bredderman trained 27 grade 5 and grade 6 students in 
controlling variables. Four training sessions were used, 
spaced over one month. The design included a pretest, 

a posttest, and a retention test one month after the 
posttest. Bredderman concluded that in the posttest 
"nearly half of the students having mean age 11.8 years 
could be classified in late formal operations with regard 
io Televehibe Glashiliigey, (sep telerekeraiell tg-paskslonl <\ciry 


The overall picture seems to be that acceleration 
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of formal-operational behavior may be not only possible, 
but also relatively easy, although the evidence remains 
quitestirins 

A methodological problem related to the training 
of operational structures has been raised by Inhelder 
and her co-workers (Inhelder Steal el O07 wi nielieres 
pariclaiie,w L969 aemlneyeclaimethateattemptseto train 
subjects in conservation tasks may succeed in inducing 
subjects to respond correctly without their acquiring 
"truly logical structures [Inhelder & Sinclair, 1969, 
perlog ls" Stheye®recommend@®rigorous icriteriavforracquzrsition 


of logical structures to eliminate responses which "are 
examples of either a specific strategy learned in one 
type of problem or of a limited operational structure, 
Dutenoveotechewacgquisitaonsof) antrueuoperatronalustructure.”" 
The specific criteria they recommend are summarized as 
follows by Kuhn (1974): "To be considered as having 
genuinely attained conservation, a trained subject must 
make responses that (a) include appropriate explanations, 
(b) persist over time, and (c) generalize to nontrained 
material." 

With regard to the first criterion (requirement 
of verbal explanations), Brainerd (1973) has shown that 
such a requirement makes tests subject to systematic 


TVpeetieerror. He argues that in Plagetian theory, cognitive 
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structures do not depend on verbal behavior, therefore 
verbal explanations constitute sufficient but not 
necessary conditions for the presence of cognitive 
structures. Although Kuhn questions Brainerd's argu- 
ment, she shows that in posttests of training experiments, 
explanations cannot be considered to be even sufficient 
conditions of structure. Verbal explanations are not 
required in the tasks used in the present study. 

The second and third of the three criteria for 
presence of structure (retention and transfer) are 
employed in the present study. There are theoretical 
difficulties associated with both of these. One 
| PAPE: relates to the extent of retention or trans- 
fer which may be acceptable as sufficient evidence of 
"genuine" structural acquisition. Kuhn (1974) has shown 
thesambaguity ain the retention criterion: ) that structural 
aCcuUIsSi~I1on 42S NOt a necessary condition for retention: 
The transfer criterion would be more convincing if there 
were never or seldom any variation in subjects' behavior 
on different tasks supposedly related to the same structure. 
In any case, in the experiment described in Chapter III, 
subjects were not taught how to do the specific tasks 
used in the posttest, so that transfer was required, and 
a retention test was administered one month after the 
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CUBE Ta hael LL 
PROCEDURE 


In Chapter I, a casé was made that the applica- 
tion of the theory of intellectual structural limitations 
rescs Onjthe validityscoe, che following ee statements: 

(AjeoOUDStagemilila -AmisenoOtwattainedsrOreamma jOnicy 
of students until the secondary school years or later. 

(b) Substage III-B is not attained for a majority 
of students during the secondary school years. 

(c) Progression from Substage III-A to Substage 
III-B in a given content area is necessarily slow (of 
the order of years) and is not subject to appreciable 
acceleration. 

For structural limitations to be applicable to 
secondary Ponon ite either (a) must be true, or (b) and 
(c) must be true simultaneously. The procedure which 
follows is designed to establish the empirical falsity of 
(a) and at the same time either (b) or (c). 

Consequently, the study has two aims: to determine 
the proportions of secondary school students who may be 
Classified as Substage III-A or Substage III-B; and to 
test whether appreciable acceleration can be induced 


in the progression from Substage III-A to Substage III-B. 
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Consequently, thes studyshas two parts: | a survey and 
an experiment. Both involve the testing of subjects 
in order to classify them according to substage. The 
tests used for this purpose, together with the criteria 
used for classification, represent operational definition 
of the terms “attainment of Substage tit Awe onda tta tn — 
ment of Substage III-B." 

The tests will now be described and the validity 
of the operational definitions they represent will be 
discussed. The procedures used in the survey and experi- 


ment will then be described. 


TESTS 


Tests for formal operational structures usually 
follow procedures described in The growth of logical 
BO le cme e |dermomr Laget allo lO) Peel Cc HneScCaDLOCeOUGCS, 
a Single task is used to classify subjects somewhere on 
the full sequence of development. For instance a given 
subject might be classified as Substage III-B, Substage 
III-A, or Substage II on the basis of his behavior on 
the one task. Separate tasks are not used for each 
substage. The experimenter's subjective judgment usually 
enters into this multiple classification process. 

In the present study, an attempt is made to 


provide a basis for classification of subjects which 
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requires minimum judgment by the experimenter once the 
criterion has been decided for a particular substage. 
In order to do this, each task was designed to test for 
one substage only. Two tasks were used to test for 
Substage III-A and two others to test for Substage III-B. 

The wide disparity in the reported percentages 
of Stage III subjects at age 13 years given in Chapter if 
was seen to be probably a function of differing criteria 
for, classification... The criterion problem--déeciding 
what will be the criteria for classification of subjects 
according to stage level--is endemic to research related 
_to stage or structure theories, because stage level or 
structure aan never be determined directly but always 
has to be inferred from observed behavior on tasks. 
The problem of establishing criteria for a stage level 
or structure has two aspects: deciding on the form of 
the task, and deciding on what behavior elicited by the 
task will constitute evidence for the inferred stage level 
or-structure. 

These two problems will now be discussed in 
general terms, starting with)the Watternjproblem sy 7 The 


specific tasks used in the study will then be described. 
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Inference criteria 

For classification into Substage III-B, most 
researchers seem to require a subject to succeed com- 
pletely on a task, or even on each of a number of tasks. 
To do this is to assume that what a given subject does 
on every occasion is what he is able to do. Nonetheless, 
this requirement provides fairly objective criteria. 

The problem of inference criteria for Substage III-A is 
more difficult, because the Substage III-A subject is 
supposed to give evidence of formal operational thinking 
while being unsuccessful on the task or tasks. This is 
where subjective decisions enter--if a subject is unsuc- 
cessful on a task, is the structure absent, or is it 
present but not yet-stable or fully developed? 

For the present purposes the problem has to be 
Coneierea in relation to the idea of intellectual 
structural limitations, where the inference of lack of 
structures is taken to imply restrictions on curriculum 
content and teaching method. In the application of this 
idea, Type II errors may be seen to be more costly than 
Type I errors. Sweeping changes to secondary school 
curricula are being proposed on the basis of an inference 
that most students lack formal operational structures. 
For example, according to Lawson and Renner, if such an 


interence as correct, then “the curriculum will have to 
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be reshaped to reflect concrete objects, events, and 
Ssituations--rather than abstract mental images, rules, 
and roles--in the teaching methods and materials used 
[1974, p. 558]." To implement major changes in curriculum 
and method when the inference of absence of structure may 
be incorrect would be imprudent. Therefore for the 
purposes of testing the idea of intellectual structural 
limitations, Type II errors should be avoided in 
classifying subjects as Substage III-A or Substage III-B. 
In other words, the conservative course of action in 

this case is to adopt rigorous criteria of failure. 

In the present study four tests were used: two 
tests to classify subjects according to each of Substages 
LII-A and III-B. In order to classify a subject into 
a substage on the basis of two tests, two possible 
criteria could be used: success on both, or ADE on 
either. | 

Suppose that a task T is used to determine whether 
a ven subject ehas ta, structuresS .sSuppose,y funther that 
a certain type of behavior B on T is considered to be 
evidence for S. If the subject actually has S but the 
experimenter judges that he does not display B on T, then 
a Type II error will be made by the experimenter. If 
on the other hand the subject is judged to display B 


but does not have S, then a Type I error will be made. 
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Such error may be random or systematic. In order to 
classify a subject into a substage on the basis of two 
tasks, two possible criteria could be used: success on 
either, or success on both. The following argument shows 
that the criterion success on either is subject to more 
Type I random error but less Type II random error than 
the criterion success on both. A further argument will 
be advanced to show that since systematic Type II error 
can never be removed in principle, the criterion success 
on either is preferable on theoretical grounds. 

First consider random error. Suppose there 


are probabilities P_ and pe thateryper lL 4random)) errors 


if 


are made on the first and second tasks respectively. 


Suppose there are probabilities Por and Dae that Type II 


(random) errors are made on the first and second tasks 


respectively. PL will be the expected fraction of the 


subjects who should have passed the first task but failed, 


and PL will be the expected fraction of the subjects who 


Ji 
should have failed the first task but passed. If success 


on both tasks is adopted as the criterion, the overall 


probability of Type I errors is P ae while the overall 
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probabititys Otel y pen linecrLOrscers Porter Por Por ie 
success on either task is adopted as the criterion, the 
1 ' 
overall probability of Type I errors is PLtPy-Pr Py, 


while the overall probability of Type II errors is 
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ae NY Therefore if Type II errors are more costly, 


' 
Tita La: 

the conservative course is to adopt success on either 

task as the criterion, as long as precautions have been 

taken to reduce systematic Type I error in each of the 

tests individually. 

In high jump and similar athletic contests, the 
criterion success on any one of three tries is universally 
adopted, since the question "Can the jumper clear two 
metres?" is answered affirmatively if he jumps two 
Metres once. The criterion in any one task of those 
employed here is analogous to the crossbar on the 
high jump stands. 

Now consider systematic error. Although a 
structure S may be a ate ee condition for a subject 
to display behavior B on task T, S may not be a sufficient 
condition for B. Subjects may fail on T not because §S is 
absent, but because of such things as misleading or 
ambiguous instructions, failure to comprehend what is 
required, lack of motivation. Therefore while presence 
of B can be taken to imply presence of S, absence of B 
can never be taken to imply absence of S. 

As Braine (1968) pointed out, "It is clear that 
if one seeks to state an age at which a particuiar type 
of response develops, the only age which is not completely 


arbitrary is the earliest age at which this type of 
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response can be elicited using the simplest experimental 
Brecedures@ipe LS/ il em inethe presentacase., ithesaimes 
to determine whether a given subject is capable of a 
particular type of response. The only criterion which 
is not completely arbitrary in this case is whether this 
type of response can be elicited using the simplest 
experimental procedures. Operationally, the "simplest 
experimental procedures" become that task among the ones 
employed which is simplest for the given subject. 

In the present study, only subjects who failed 
both of the pair of tests for a given substage were 


classified as being below that substage. 


Task selection 

That part of the criterion problem related to 
design or selection of tasks amounts to the problem of 
task validity. The validity of employing volume conserva- 
tion, controlling variables, and generating combinations 
as indicators of Substages III-A or III-B will now be 
discussed, starting with volume conservation. 

According to Piaget, volume conservation "is not 
worked out conceptually before the beginnings of the 
formal devel" because- iwithout a doubt =| 3) © the conserva-— 
tion of volume throughout changes of form presupposes the 
abilyvty, to handle proportions, [Inhelder & Piaget, 1958, 
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on the formal operational four-group structure (Inhelder 
& Piaget, 1958, pp. 176-181). An advantage of volume 
conservation is that a fairly straightforward and 
objective test may be used to detect the acquisition. 
Controlling variables seems to provide both a 
sufficient indication of the beginnings of Substage 
ITI-A and the attainment of Substage III-B. In examining 
the mechanism by which subjects move from concrete 
Operations to formal operations, Piaget has pointed to 
realization of the need to control variables as a turning 
Portetinhelder & Piaget, 1958 "pp. 282-283)". This 
realization seems to be characteristically present in 
formal operational subjects, and characteristically absent 
in concrete operational subjects. Apparently, when two 
independent variables are involved in a situation, not 
only do concrete operational subjects fail to control 
one of the independent variables in order to test the 
effect of the other on the dependent variable, "but the 
idea of doing so does not even occur to them [p. 285]." 
Moreover, Piaget claims that the use of the method of 
proof based on controlling variables is by itself alone 
a sure indicator of the combinatorial system, one of the 
characteristic structures on Stage III (Inhelder & 
Piaget, 71958) pv.244),.eThis implies that recognition 
of the need to control variables is a sufficient condition 


for Substage III-A. 
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Furthermore, the scheme of "all other things 
being equal" is supposed to manifest itself differently 
in Substage III-A and Substage III-B. "As a general 
rule and in its authentic form, the schema ‘all other 
things being equal' appears only at Substage III-B 
[Inhelder & Piaget, 1958, p. 43]." At Substage III-A 
this type of proof “is still only partially, understood 
[p. 12]." For these reasons, manifestation of the idea 
of controlling variables seems to be an appropriate 
criterion for the genesis of formal operations, and 
ability to use the scheme an appropriate criterion for 
full realization of formal operations. 

Another test for Substage III-B is provided by 
thescombanatorial system. This structure enables a 
subject to generate systematically all of the possible 
combinations of a set of objects (the set of all possible 
subsets). The use of the combinatorial system as a test 
for Substage III-A is somewhat difficult, because at 
this level the subject is supposed to proceed in some 
systematic manner to generate combinations, but without 
success. There is some subjective difficulty in deciding 
whether a subject is proceeding systematically or is 
randomly generating the combinations. On the other hand, 
the criterion for Substage III-B may be straightforward 
and objective. The subject may be given a set of 


elements and asked to combine them in all possible ways. 
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If the subject succeeds in generating all of the possible 
combinations given sufficient time, then that subject 
may be classified as Substage III-B. 

Hence two tests of controlling variables, a 
volume conservation task, and a combinatorial system task 
were used in the study. The actual forms of these tests 
will now be described. In subsequent sections, the 
procedures for administration of the tests, the sampling 
procedure, and the instructional sequences will be 


described. 


VOLUME CONSERVATION TEST 


The volume conservation test to be denoted VC, 
is a group administered version of a procedure previously 
designed by the experimenter to minimize systematic ~ 
Type I and Type II errors. An extended rationale for 
the form of the test is given elsewhere (Hobbs, 1975). In 
order to be classified as Substage III-A on this task 
subjects Nave to~predict correctly that when a ball o£ 
clay is transformed in shape (rolled into a "sausage," 
broken into pieces, Ore ft VateenedainctO dea pancakes )s,ana 
then immersed in a glass of water, the water level will be 
thessSame as tOL an Initially, 1dentical untransformed ball 
placed in an identical glass of water. In order to minimize 


TVpemleerrorn due to subjects correctly predicting the water 


Sis, 


level on the basis of conservation of weight, the testing 
procedure employs a training sequence designed to illustrate 
to subjects that when a body is immersed in water, the 
Occupied volume rather than the weight determines the 
displacement of the water level, and a test to determine 
which subjects do not understand this. The training 
procedure also teaches subjects how to respond on the 
answer sheet. 

In order to minimize Type II error due to the 
use of relational terms such as "more" "same" or "less" 
the question subjects have to answer is "Where will the 
water come to?" In addition, subjects respond nonverbally 
by making a line on a diagrammatic representation of the 
glass to indicate their predicted level. Instructions 
for the tester and the response sheet are reproduced 


in Appendix A (pp. 113-125) 


CONTROLLING VARIABLES TESTS 


Two tests of controlling variables were used. 
The first, to be denoted "CV-A," was used to classify 
subjects into Substage III-A. The second, to be denoted 
"CV-B," was used to classify subjects into Substage III-B. 
CV-A was designed to determine whether a given 
subject recognizes the need to control variables. Sub- 


jects were shown two metal rods: a brass rod 51.4 cm 
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long, 5 mm in diameter; and an aluminum rod 45.5 cm long, 
12 mm in diameter. Each subject was supplied with a 
printed sheet describing an experiment in which a 

student attempts to bend the two rods. The test form 

is reproduced in Appendix A (os 121). By means of a 
diagram subjects were shown that the brass rod bends far 
more than the aluminum rod. Subjects were asked whether 
this proves that rods made of brass bend more easily than 
rods made of aluminum, and why or why not. A further 
question required subjects to suggest two ways to improve 
the experiment. In order to be classified as Substage 
III-A, a subject had to give evidence of seeing the need 
to control at least one of length, diameter, or bending 
force applied. 

CV-B was designed to test whether a subject could 
devs can puOceducemtOmtestathegeliceteot agsinglejvariable 
while holding a number of other variables constant. In 
the Inhelder-Piaget (1958, Ch. 3) experiments, subjects 
were presented with a series of metal rods of varying 
diameter, cross-sectional shape, and material. The rods 
were clamped at one end so as to project horizontally. 
The lengths projecting from the clamp could be varied, 
and different weights could be attached at the ends of 
the rods. Subjects were required to test the effects 
of any one of the variables on the flexibility of a rod, 


Measured by the amount of bending which occurs when weights 
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are hung on the end. 

CV-B is a variation of this test designed to 
be administered to groups of subjects and objectively 
scored. Subjects were supplied with a set of seven 
wooden planks varying according to type of wood (pine 
Sreceawood)l, slengthes(z0-cm Ol. s0.cm)),, width (2.3°>cm 
or 4.5 cm), and thickness (7 mm or 14 mm). The rods 
were labelled A to G. The redwood planks were clearly 
Oiirerentein color srfrom the: pine planks, and in addition 
were clearly labelled as redwood or pine. 

Subjects were given a written test to accompany 
the set of planks. This test is reproduced in Appendix A, 
cocethenewi thm theminstructtons fom thewtester, (pp. Ll3-125). 
With the aid of a diagram, subjects were shown a method 
OmecestingerOre stitiness. OL a plank, .by placing, the 
plank on two blocks and setting a weight on the middle. 
In a series of three written questions, subjects were 
then asked how they would test whether the stiffness was 
affected by length of plank, thickness of plank, or type 
of wood respectively, and to name the planks they would 
use. In each case, only one pair of planks would give 
a valid test properly controlling variables. 

A subject could thus select none, one, two, 
or three pairs correctly. The set of planks is so 


constructed that three are pine, four redwood; three are 
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long, four short; three are wide, four narrow; and three 
are thin, four thick. There are consequently 12 possible 
ways to pick a pair of planks which differ on one given 
variable. A subject has three chances to select a 
correct. pale (tnireerquescions)), SO™thatethew probability 
of selecting exactly one pair correctly by chance is 


— 
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There are three different sets of three responses 
SOUuta MmingmexacimlymtyomCOrreculpalts. theretore the 
; aes of selecting exactly two pairs correctly by 
chance is 
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The probability of selecting exactly three pairs 

correctly by chance is 
1 1 1 1 

Lond 2 EN DMC 288 

If the criterion for Substage III-B is set at 
two pairs chosen correctly, then. the proportion; of all 
subjects classified incorrectly as Substage III-B would 
have an expected value of only 0.02. On the other hand, 
if there is a finite probability p that a given Substage 
III-B subject might make a mistake in selecting or 


writing down a pair, the probability that such a subject 
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will be incorrectly classified as Substage III-A is 
3p with a criterion of three pairs correct, and approxi- 
mately 3p? withPasicni terion ofitworpairs correct .+ 

Thus using a criterion of three pairs correct, 
DYpem: errorsgarceunt vcelyyabutiithe probabitlity “of Type IT 
ennouSemayebeuquite. hvoghwem ror aicriterion tof two pairs 
correct, Type I errors remain virtually negligible, but 
the probability of Type II errors is reduced virtually 
DymamsactOreof BUuseli. = On ‘thes basis’ of™thi's ‘analysis, 
the criterion used for Elassweic alien into’ Substage® LLII-B 


on CV-B was set at two pairs selected correctly. 


COMBINATORIAL SYSTEM TEST 


In the combinatorial system test used by Inhelder 
and Piaget (1958, Ch. 7), subjects were presented with 
four different chemical solutions in flasks, and another 
in a dropper bottle. Subjects were shown that when some 
of the chemicals were mixed in a certain way, a yellow 
colour was formed on addition of the liquid from the 
dropper bottle. They were then required to reproduce the 
yellow colour by mixing the chemicals. Subjects who 


proceeded systematically to generate combinations of the 


lactually the probability is ae (las <p). 
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four solutions were classified as Substage III-A. The 
more systematic subjects were classified as Substage 
III-B, the difference being "only one of degree [p. 120]." 
The difficulty with using the combinatorial 
system test to determine Substage III-A lies in the 
subjectivity of the judgment as to whether a subject 
is proceeding systematically to generate the combinations 
or is randomly combining the solutions. The test used 
in the present study to be denoted CS was designed on 
the assumption that given a time limit of 15 minutes, 
only Substage III-B subjects can generate all possible 
subsets of a given set of four elements. In pilot 
administrations of CS, after 15 minutes subjects were 
found either to have finished, or to be duplicating 
combinations. When a shorter time was used (10 minutes), 
some subjects complained that they needed more time. 
Subjects were given a printed question sheet and 
answer sheet. These sheets are reproduced in Appendix A 
(ppeli3—125)ee The prob lemawhichesubvectsmnad@toucons1cer 
was the number of different ways a transistor radio 
can be bought given the possibility of up to four 
"extras" (earphone, carrying strap, leather case, or 
extra battery). Subjects were required to write or draw 
"as many different choices they could think of" on the 


answer sheet. A time limit of 15 minutes was imposed. 
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There are 15 possible ways of combining the extras,as 
well as the possibility of no extras. Subjects who 
generated “all 16 possibilities,’ and subjects who 
generated 15 not including the possibility of no extras, 
were classified as Substage III-B. Pilot research 
involving this test indicated that most subjects who 
could systematically generate all combinations seemed 
Pomecetetiadt a ieee nWemestioet =(radiOwalone jae wentewithout 


saying." 


GENERAL FEATURES OF THE TESTS 


The four tests have the following features in 
common : 

(a) Each test is designed to classify subjects 
according to one substage only, either Substage III-A 
eye Ais 

(b) The criterion for success in each case is 
objective. 

(c) The tests may be administered to groups 
of subjects. 

(d) Subjects record their own responses with 
pen or pencil on the answer sheets. 

(e) Each test takes from 12 to 20 minutes to 


administer. 
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Having each test designed to determine only 
One stage level allows the adoption of objective criteria. 
These objective criteria are as follows: 

In the volume conservation test, the subject 
had to draw lines representing predicted water levels 
in four instances. If a subject drew all four lines 
correctly he was classified as Substage III-A. A line 
was considered to have been drawn correctly if any part 
of the line was within 0.5 mm of the correct level. 

This was determined by drawing a line 1 mm thick at 
the correct level. 

In CV-A, the subject had to indicate whether 
the test described proves that rods made of brass bend 
more easily than rods made of aluminum, to say why or 
' why not, and to suggest two ways of making it a better 
experiment to test whether rods made of brass bend more 
easily than rods made of aluminum. A subject was 
classified as Substage III-A if either or both of the 
following: 

(a) he answered "no" to the first question, 

(b) he suggested in answer to the third question 
that at least one of length, diameter, or bending force be 
equalized for the two rods. 

In CV-B, the subject was asked how to test for 
the effects of length, thickness, and type of wood 


respectively on the stiffness of a plank, and to indicate 


which planks to use. A subject was classified as 
Substage III-B if in his answers he selected a correct 
pair of planks at least twice. 

In the combinatorial system test, the subject 
was required to indicate in : 15-minute time limit all 
the different ways a radio could be bought given four 
possible "extras." A subject was classified as Substage 
III-B if he generated all 16 possible ways, or 15 ways 


not including the radio alone. 


ADMINISTRATION OF THE TESTS 


Tests were administered to subjects in groups of 
seven to fourteen. The volume conservation test was 
administered verbally according to predefined protocols 
together with demonstrations. The other tests were in 
written form, but the tester read the instructions and 
questions to subjects in each case. Instructions for 
testers were prepared in advance and these are reproduced 


ineADDendi xeA (pp sal lg—125))— 


THE SAMPLES 


A stratified random sample of grade 7 students 
and a stratified random sample of grade 9 students were 
selected from the populations of grade 7 student and 


grade 9 students in three rural high schools in the 
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Annapolis Valley of Nova Scotia. The grade 7 population 
consisted of 529 potential subjects. The grade 9 
population consisted of 455 potential subjects. Fruit- 
growing, small farming, and tourism are the major 
industries in this predominantly white area. The schools 
in the region adopt a practice of streaming classes 
according to IQ. The less able students in the lower 
streams pursue "modified" or "adjusted" programs in 
smaller classes. In order for the samples to be 
representative Ofsthe popusations with respect ito 
ability, sex, and educational experience, subjects were 
' selected randomly from a number of strata. Most strata 
consisted of all subjects of the same sex in the same 
class. In order to preserve proportionality, small 
classes were combined to form single strata in two 
cases, and males and females in a small class were 
combined in one case. 

The number of students of each sex in each of the 
strata, and the number selected from each stratum are 
shown) instables) land 2... .the gquade./esamplemconsisted 
Oteovesub jects, J0smalerandeacvmecena le mee sm LepLesents 
aesamplemctompopu LatioOnmnrattOorolmU 2 meu lica.gbAdCCmoumSami le 
consisted of 54 subjects, 25 male and 29 female. This 
also represents a sample to population ratio of 0.12. 


In the random selection, more than the required number 


TABLE 1 


Composition of Strata and Sample 


Grade 7 
Population 
School Section Males Females 
Central Kings a 22 16 
2 e7 18 
3 16 16 
4 17 18 
5 17 20 
6 10 15 
7 10 4 
Hants West A 15 20 
B pak 18 
C 20 Hy! 
D 21 16 
E 13 5 
F 14 2 
Cornwallis A 14 18 
B 16 is’ 
e id 11 
D 16 10 
E ial 10 
Total 282 247 


Sample 
Males Females 
2 2 
2 2 
2 2 
2, 2 
4 2, 
2 2 
2 2 
2 2 
Z 2 
P2 2 
2 1 
2 2 
2 1 
ib iL 
2 1 
ai a 
30 27 
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TABLE 2 


Composition of Strata and Sample 


Grade 9 
Population Sample 
School Section Males Females Males Females 

Central Kings 16 16 its) 2 2 
Led 17 ga) 2 2 

18 14 14 2 2 

19 15 eS 2 2 

20 17 10 2 ik 

aa 6 ay 1 -- 

Hants West A dks) 20 2 2 
B 10 20 1 2 

Cc 17 12 2 2 

D 1k3} ily; 2 2 

E 17: 9 1 1 

Cornwallis A 15 19 2 2 
B 5 19 Z 2 

C 182 22 at 3 

D 1B iy 1 2. 

E 5 DS a m2) 

Total 0 245 2D) a 


of subjects were selected in each stratum, and the 
potential subjects were listed in order of selection. 
Subjects who were absent for one of the pretests were 
replaced by the next available member on the list for 
that stratum. No replacements were made after the 
pretests. 

Individual IQ scores were not available for 
the subjects, but a frequency distribution according 
to Lorge-Thorndike IQ was obtained for most of the 
grade 7 subjects who took the posttests. The frequency 
distribution is shown in Table 3. A chi-square goodness 
of fit test indicates that the frequency distribution 
LoenOemscTOnit Cant yedlifterent) from ythe normal distri-. 


bution (X* = 4.91, df = 5, p > 0.30). 


INSTRUCTIONAL SEQUENCES 


Two instructional sequences were used: The 
combinatorial sequence was designed to teach a systematic 
method of generating the set of all subsets of a set of 
four elements. The controlling variables sequence was 
designed to teach (a) the need to control variables, and 
(b) the skill of selecting test pairs from given arrays. 
Full-details of instructions for the teachers of the 


sequences are given in Appendix B (pp. 126-137). 


47 


48 


TABLE 3 


Frequency Distribution of IQ of Grade 7 Sample 
and Goodness of Fit to Normal Curve 


IQ Range Observed Expected Chi-Square 


0- 80 4 4.8 3 
81 - 90 6 8.5 74 
91 - 100 15 18h 2 .25 
101 - 1lo 9 13.2 lense 
111 - 120 12 8.5 1.44 
121 - a. OS Loi 

Total 53 53 4.91 
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The combinatorial sequence 

The combinatorial sequence consisted of three 
20-minute sessions. In the first session, each subject 
was presented with 20 paper baking cups and an envelope 
containing 40 cardboard triangles of varying size and 
shave, alo@olue ,.Uered,mLOn yellow, and LO black; Sides 
of the triangles varied from about 2 cm to 4 cm. Sub- 
jects were given the task of placing one, two, three, or 
four- triangles in the cups in as many different ways 
aS possible, so that no two cups had the same contents, 
and no cup contained more than one triangle of any one 
scolour. In pilot testing the use of cups for the 
combinations was found to reduce subjects' tendency 
to consider permutations rather than combinations. 
-As the subjects attempted the task, the instructor 
supervised subjects' activity, clarifying the rules, 
and pointing out incorrect subsets or duplicates. The 
‘objective of this session was to make sure that subjects 
‘understood the problem in preparation for being given 
a systematic method of solving it. 

“In the second session, the instructor explained 
a systematic way to generate the complete set of com- 
binations by ones, twos, threes, and fours. Subjects 
were then allowed to practise the systematic method, 


first with the cardboard triangles previously used, 
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then with kits of four different kinds of objects-- 

10 each of paper clips, paper fasteners, "bingo chips," 
and pennies. Instead of baking cups, pink sheets 

ruled into 20 squares were used. Subjects were allowed 
to keep the pennies. The systematic method consisted 
Oofsplacing, the cbjects sin a row, then taking successive 
Dalttceas (Ol lowsess firsts and second, first and third, 
iceman OmLOlGchiemoecCcOondmandmthird, «second andl tourth, 
third and fourth. Sets of three were then generated 
DysoOmtUCtiIng One Or eatNesrOur In turn, and) finally ail 
four were taken together. 

In the third session, subjects were given 
printed sheets giving practice in generating combinations 
using pencil and paper rather than manipulating real 
objects. These printed sheets are reproduced in 
Ae penol wba DO. 20-t57) =n Chestinst part ot the 
practice session, subjects were given a choice of three 
problems. One required the generation of all possible 
POuCcEitsugiven foursditferenteitems of clothing. 
Another required the generation of all possible combina- 
tions of ketchup, relish, cheese, or onions on a 
hamburger. The other required the generation of all 
possible combinations of whitewall tires, automatic 
transmission, radio, or car heater in ordering a new 


automobile. 


SL 


In the second part of the practice session 
Subjects were required to draw "all the different ways 
you can have a square" if the square can have any of 
aneinscribeducircle,masdaot anesthe centre, .a.border, or 
a cross of diagonals. At no time throughout the 
combinatorial sequence was any mention made of the 


transistor radio problem. 


Controlling variables sequence 

Like the combinatorial sequence, the controlling 
variables sequence consisted of three 20-minute sessions. 
The first session was designed to teach subjects the need 
POmCONLEOUVallablec.mwalne instructor demonstrated! two 
experiments where control of variables was necessary 
but not carried out, and then used the results to draw 
false unlikely conclusions. 

In the first demonstration, two glasses were 
shown to subjects, one 9 cm high and 8 cm in diameter 
at the top, the other 11 cm high and 6 cm in diameter 
at the top. A labelled container of sugar, a teaspoon, 
and a tablespoon were placed on the table beside the 
the glass. The instructor demonstrated that "sugar 
dissolves quicker in a short AGES thane Ngamtakieg Lass ju 
by pouring water into the two glasses to different levels 


and stirring a large amount of sugar from the tablespoon 
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in the tall glass and a small amount of sugar from the 
teaspoon in the short glass. A discussion ensued as 
subjects objected that it was not a proper proof, and 
that both equal volumes of water and equal amounts of 
Sugar would be necessary to "do it right." 

The second demonstration involved "proving" 
that red coloured rubber is "stretchier" than blue 
coloured rubber by hanging a heavy weight on a long 
thick red rubber band and a light weight on a short thin 
blue rubber band. A discussion ensued with the conclusion 
that equal weights, and rubber bands of the same length 
and thickness, would be needed to perform a proper test. 

In the second session of the controlling variables 
sequence subjects were shown a photographic light meter. 
Its use was demonstrated using a light bulb. An array 
of light bulbs of different shapes and wattages, some 
frosted and some clear, were shown to subjects. The 
abvay ole lignt bulbs is 1)tustratedwin Fig. 1.8) scub ects 
were required to pick out pairs of bulbs to be used to 
test whether colour (frosted or clear), wattage, or 
shape affect the light meter reading. The remainder 
of the second session and the third session were designed 
to give subjects practice in selecting test pairs from 
given arrays. In session two, each subject was given a 


packet containing eight strips of paper; one blue and 
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Array of light bulbs used in controlling variables sequence 
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one pink strip of each of the following four sizes: 
IS cley (ole te FAG sy Kort 
Joins) Wel bi, base (ali 
32-0 CM exec oe Cr 
SO mCi xe ee Om C ils 
Subjects were asked to select pairs "the same 
colour, the same length, but different width" and so on. 
The session concluded with similar practice using (a) 
rubber tubes of different colour, length, and diameter; 
(b) metal rods of different material, length, and diameter. 
In the third session of the controlling variables 
sequence subjects were given practice in selecting pairs 
differing in only one aspect from pictures of arrays of 
objects varying in a number of different aspects. The 
pictures and printed questions are reproduced in 
Appendix B (pp. 126-137). In this case, subjects worked 
with pen on paper rather than with real objects. ahpouene 
out the controlling variables sequence no mention was 
made of the problem of the brass and aluminum bars nor 


the problem of the wooden planks (CV-A and CV-B). 


PROCEDURE 


Tests and instruction sessions were administered 
to subjects in their respective schools so that any one 
test or instructional sequence was given to all subjects 


on the same day but at different times. The schedule 
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schedule in Table 4 was followed for the grade 7 sample. 
In each session the subjects were divided into two equal 
Or nearly equal groups. Two experimenters were always 
present. Each group spent 20 minutes with each experi- 
menter to make a total of 40 minutes for each session. 
Each experimenter administered the same test or the same 
instructional session to all subjects in the one day. 

Five different experimenters (coded A to E in 
the schedule in Table 4) took part at various times 
according to availability. All experimenters were male. 
All were experienced in teaching secondary school 
students and in administering tests. 

The four tests were administered to the grade 9 
sample on April 10, 1975. One experimenter administered 
the volume conservation test and the CV-B test, the other 
experimenter administered the CV-A test and the combina- 


torial system test. 


SUMMARY 


Samples of 57 grade 7 students and 54 grade 9 
students were selected from the populations of grade 7 
and grade 9 students in three rural high schools in Nova 
Scotia antesteon Volumes cOonservallols twOucests of 
controlling variables, and a test of generating combina- 


tions were administered to both samples. Subjects were 


Session 
Number 


i 


TABLE 4 
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Testing and Instruction Schedule for Grade 7 


Date 


February 18 


February 24 


February 27 


February 28 


March 5 


March 7 


jsyopensal 9 


TesStworeinste nice1on 


volume conservation 
CV-A 


CV-B | 
combinatorial system 


NO-ESe, BCONLLOI Ing 


variables sequence 


Now eeonbunarorilal 


NOL ye COnGLlo bling 


variables sequence 


NO BCOMDInaromia. 


Noms eCOncLrolLing 


variables sequence 


NOeeeo pmCOmbInatorvia 


Posttest CV-A and CV-B 
Posttest Combinatorial system 


Retention test CV-A and CV-B 


Experimenters 
A Pretest 
B Pretest 
A. Pretest 
C Pretest 
A Session 
C Session 
sequence 
A Session 
Session 
sequence 
Session 
&E Session 
sequence 
D 
E 
A 
D 


Retention test Combinatorial 
system 


Dh 


classified as. Substage III-A or Substage III-B on the 
basis of these tests. Each test was designed to reduce 
the possibility of systematic Type I error and objective 
criteria were used to score the tests. The grade 7 
sample was then subjected to a total of two hours of 
instruction in controlling variables and generating 
combinations over a one-week period, and retested for 
Substage III-B immediately after and one month after 


the instructional sequence. 


CHAPTER TLV. 
RESULTS 


Complete results are shown in Table 5 for the 
grade 7 sample, and in Table 6 for the grade 9 sample. 
Puem@prOpOrtions of ssubjects who met the criteria 
established in Chapter III for success on individual 
tasks and for classification into Substages III-A and 
III-B will now be considered. Subsequent sections 
of this chapter will be concerned with changes in 
subjects' performance on aie following the instruc- 
tional sequences and the retention period, and with 
correlations among the tasks. The chapter concludes 


with a discussion of the results. 


PERFORMANCE ON TESTS 


Percentages of subjects who were successful on 
each task are shown in Table 7. Percentages of subjects 
who met Substage III-A and Substage III-B criteria are 
shown in Table 8. Before instruction, 51% of the grade 7 
subjects and 80% of the grade 9 subjects passed either 
Or both sofa VCaand CV-A and were classified as Substage III-A. 


Before instruction, 47% of the grade 7 subjects and 50% 
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TABLE 6 


1 


Results for Grade 9 Sample 


SIE vc CVE CV=5 CS 


Subject 
Number 


10 


ital 


ee 


iE) 


14 


15 


16 


iby) 


18 


19 


20 


a 


TARE Ou COn td ) 


Subject Sex VC CV-A CV—B CS 

Number 
a2 M i : 2 2 Lee 
PLS M 2 2 1 a 

24 F 2 2 Le 2 
25 F 1 Z 2 il 
26 M 1 2 Lu 1 
ah M 2 2 2 2 
28 F 2 2 1 2 
29 F Z 2 2 2 
30 M - 1 A at 
31 M 1 2 L 1 
32 F 1 1 af 2 
53) E a a 1 2 
34 M 2 2 2 Z 
35 M 2 2 2 2 
36 F Z ae i il 
a M it 2 1 1 
38 M 2 2 2 2 
39 M 2 2 Ad 2 
40 F 2 v7 2 2 
41 M 2 2 1 1 
42 M i 2 A 2 
43 M af iz 1 i. 


TABLE 6 (cont'd) 


Subject Sex vc CV-A CV-B cs 


Number 
44 F 2. 2 Z 1 
45 F 2 i Z 2 
46 M 2 2 1 ub 
47 F 2 1 1 1 
48 F 1 2 1 1 
49 F 1 1 AL 2 
50 M 2 ih i aL 
Sik F 1 1 1 1 
2 F 1 ib 1 1 
a ve) F 2 1 ti i 
54 F AL il u 1 
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TABLE 7 


Per Cent of Subjects Successful on Tasks 


Grade 7 Grade 9 
Task Pretest Posttest Retention N=54 
N=57 N=50 N=47 
Volume 
Conserva- 39 57 
tion ' 
CV-A 32 48 64 65 
CV-B Bye 44 51 Sul 
cs 16 48 45 39 
TABLE 8 


Frequency of Subjects Classified as Substage III-A 
and Substage ITI-B 


Met Substage III-A Metwsoubstagesa Ii. 
Test Criteria Criteria N 


Frequency Per Cent PCECGUCNCYV ere Ge Cel tc 


Pretest 

Grade 7 29 51 20 47 57 
Posttest 

Grade 7 34 68 50 
Retention 

Grade 7 314 68 47 


Grade 9 43 80 Paty 50 54 
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of the grade 9 subjects passed either or both of CV-B 
and CS and were classified as Substage III-B. (Substages 
III-A and III-B are not exclusive categories here.) 

Comparisons between grade 7 pretest performance 
and grade 9 performance can readily be made using 
Tables 7 and 8. A directional chi-square test (Siegel, 
1956, pp. 104-111) is convenient to determine whether 
the proportions meeting criteria in the grade 9 sample 
are Significantly higher than in the grade 7 sample. 
The grade 9 performance is significantly better than 

2 


(hesgrade 7 pretest performance on VC (xy = 4.36, df = 1, 


025), on CV-A (x2 = 11.0, df = 1, p < .0005), and on 


Ke) 
A 


CS (x? One) CO lye ces) ) ee elaucke 1S en@ssigniticant 
difference between the grade 7 pretest preformance and 


the grade 9 performance on CV-B EL < or 


, af =1, 
p > .10). | 

The proportion of grade 9 subjects meeting the 
criteria for Substage III-A is significantly higher than 
the proportion of grade 7 subjects meeting the criteria 
for Substage III-A (x* = 8.92, af = 1, p < .005). On the 
other hand, the proportion meeting the criteria for 
Substage III-B is not significantly higher for the grade 9 
sample than for the grade 7 sample on the pretest 
(ee TOU C tame epee lO) Le eMoOcemtitangnalreotethe 


grade 7 sample and four-fifths of the grade 9 sample met 


shaxp 2¢ 
ae Tied 


68 


the Substage III-A criteria. About half of each of the 
‘grade 9 sample and the grade 7 sample on the pretest 


met the Substage III-B criteria. 


INSTRUCTION VAND RETENTION EFFECTS 


For the grade 7 sample, performance on the pretest 
was compared with performance on the posttest using the 
McNemar test for significance of changes (Siegel, 1956, 
DD.035-0/)Pa@eeroOr Gach of CV—-A, CV-B, CS, and the Sub- 
Stage J1I-Bi classification, the following null hypothesis 
was tested: 

H? for those subjects who change from pretest 
LO posttest, the probability (P.) that any subject will 
‘change from pass to fail is equal to the probability (Ps) 
Pictmtnemsioyect will change from tail to pass. =| That is, 


Pree=ePpey= 1/2. H 


> A 
a a E ve 


1? sed 

A directional test was used since subjects were 
predicted to perform better on the posttest than on the 
pretest. The significance level was set at five per cent. 
At this level for a one-tailed test the critical value of 
chissovdreswichponceccolccmO1m eed OmMmioesa/) mngs Ltd — 
ficant chi-square indicates that performance on the 
posttest was better than performance on the pretest. 


Table 9 shows changes in performance on CV-A, 


CV-B, and CS between the pretestand posttest. The number 


69 


TABLE 9 
Changes Between Pretest and Posttest + 
P 2 
osttest x 
Fail Pass (McNemar Test) 
CV-A (N Sten) 
Pass il 165 
Pretest 
Fail 22 7/ WARS 
CV-B SUN a= 50) 
Pass 4 12 
Pretest 
Fail 18 16 6.05* 
CS (NS 30) 
Pass 3 6 
Pretest 
Fail 23 18 935% 
Substage III-B Classification (N = 50) 
Pass 4 21 
Pretest 
Fail 2 13 oi (ha 
* 
9) <3 als) 
A 


In this and subsequent tables, only subjects present 


for both administrations are shown. 
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of subjects present for both pretest and posttest was 
45 for CV-A and 50 for CV-B and CS. Performance improved 
‘between pretest and posttest for both CV-B ne =~ 6:05-, 
df = 1, p < .01) and cs (y* = 9.35, df = 1, p < .005). 
mhere-swas no Significant improvement in CV-A (x7 =i) duet esyy 
oe; De eee Ot ct moe SOmSCNOWS Clianges Ill subjects. 
classification according to-Substage III-B criteria 
between the pretest and posttest. The number of subjects 
Classiri1ed, aseoubStagesLl1-B increased from 25 to 34. 
This improvement is significant (x7 sai Sheil: (be oo aky. 
jo SS Silas 

Table 10 shows changes in performance between 
pacstrest and retention test. Changes are not significant 
for CV-B (x? Oye ee lye el) nOL eLOTECS (x? = O13, 
Otel at DceecL 0). ae De spropontionwofesubjectsaclassified 
as Substage III-B did not change significantly between 
posttest and retention test ee = Mths IE Yahe Se Wh fe ee IMO) 
For CV-A, however, more subjects passed the retention 
test than passed the posttest. The difference is signi- 
ficant using a non-directional test (x? = 4.00, df = 1, 
Pia 0>)s. 

A more direct test was used to determine whether 
lasting significant gains were achieved in Stage III 
“performance by comparing pretest performance with 
retention test performance. The results are shown in 


Table 11. Significant improvement between pretest and 
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TABLE 10 


Changes Between Posttest and Retention Test 


Retention Test Chi-Square 
Fail Pass (McNemar Test) 


Pass 3h 18 
Posttest 
Fail 13 8 Res SIC! 
eapsiey (Ue olay 
Pass 6 16 
Posttest 
Fail ks! 5 0.32 
cs (N = 40) 
Pass 3 15 
Posttest 
Fail aly) 5 0.70 
Substage III-B Classification (N = 40) 
Pass 5} 25 
Posttest 


Fail 9 s Whig EE 


2 


TABLE 11 


Changes Between Pretest and Retention Test 


Retention Test Chi~-Square 


Fail Pass (McNemar Test) 


CV-A (N = 43) 


Pass Z 12 
Pretest - 
Fail Ve 17 10.3 
CV-B (N ="47) 
Pass 2 14 
Pretest 
Fail 20 11 4.9 * 
eS 1S ayy 
Pass i 4 
Pretest 
Fail 25 ee 7 25 * 
Substage III-B Classification en S17) 
Pass 2 19 
Pretest 
Fail es 13 6.66* 


*p ¢.05 
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retention test was shown in CV-A (x? = LO e305 Le il, 


p< .005),.in CV-B (y° = 4,5 , df = 1, p < .05), and in 
cS (x? = 12.5, af = 1, p < .0005). The proportion of 
subjects who met the Substage III-B criteria increased 
significantly from 45% to 68% (x7 = 6.66, df = 1, p < .005). 
To summarize, Significant gains occurred in per- 
formance on CV-B and CS between pretest and posttest. 
Significant gains occurred in performance on CV-A, 
CV-B, and CS between pretest and retention test. The 
proportion of subjects classified as Substage III-B 
increased significantly from 50% to 68% between pretest 
and posttest, and from 45% to 68% between pretest and 
retention test (percentages calculated of subjects who 
were present for both tests). 
One further aspect of the changes in behavior was 
considered: that of a predicted interaction between age 
and instruction with Substage III-B as criterion. According 
to the idea of intellectual structural limitations, those 
Subjects most likely to accelerate to Substage III-B in 
the period between pretest and posttest would be the 
older subjects. These subjects would be more likely to 
be closer to "natural" development of Substage III-B 
behavior. A two-way analysis of variance with age and 
instruction as independent variables and performance on 


tasks as criterion was applied. Performance on tasks 


was determined by allotting one point to a subject for 


- 
. 


“y ‘ ae -_ e b 
oti iS 2) us Saal 
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success on each of CV-A, CV-B, and CS, making a possible 
three points. This design has repeated measures on 

the instruction factor. Subjects were all chosen from 

one grade level in the study, and since students of 

lower achievement often repeat years in the educational 
system which supplied the sample, some precaution was 
necessary to counteract the possibility that any inter- 
saction effect would be masked by older subjects of lower 
ability. The precautions were first, the removal from 

the sample of subjects over the -age of 169 months (older 
subjects Hee ears entered school late or else repeated 

a grade); and second, the removal from the sample of 
students in the lower ability streams in each school 

(those streams pursuing modified or adjusted programs). 
After the exclusion of such subjects, 35 remained who 

were present for both pretest and posttest. Age was made 

a fixed factor with two levels: those born before and 
those born after January 17, 1962. The subject of 

median age (born January 17, 1962) was excluded, equalizing 
the cell frequencies. There was no significant interaction 


between age and instruction (F = 2.00, df = 1.32, p > .10). 


CORRELATIONS AMONG TESTS 


‘Tables 12 and 13 are matrices of phi-coefficients 


of correlations among the tests for the two samples. 


a “ 
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Phi-coefficients Among Tests (Grade 7 Sample) 


TABLE 12 
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ee SE 
Retention Test 


vc 

Pretest 

vc 

CV-A .31* 

CV-B .18 

CS ale 
Posttest 

CV-A 15 

CV—EB es. 10 

CS 04 
Retention 
Test 

CV-A. .23 

CV-B .08 

cs 6 ihe) 


Pretest 
CV-A CV-B Cs 


mo Ls 


ASR? 


22 


Oe 
24 


05 


42% 
25m 


-30* 


18 -O1 
Aoi Re «22 

meal) 

= 20 

- 20 -18 
26 - 10 
-05 09 
oil -.06 
-40* 04 
oL7 20 


POStLest 
CV-A CV-B 
ake) - 10 
Ol eee 24 
-20 2.0 
18 3 he) 

14 
14 
A eee 
Ses ene) 
2 Oe 
A De eA A 


CS CV-A 
04 223 
~05 ~42* 
05 eel 
OST OS 
Aceon 
eee o ede) 
14 
14 
APRS ~34* 
oO) re OO 


CV—=5 


08 
oe On 


-40* 


-34* 


CS 
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TABLE 13 


Phi-coefficients Among Tests (Grade 9 Sample) 


vc CV-A CV-B CS 
vc 24 -42* 0s 
CV-A 24 moo od NS) 
CV—=B ~42* 4, ebay OL 
CS Ue pals) 4 Ss 


p< -05 
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For N = 50, a Sicoer a ecent Ofs0 s2 cause soni ficantly 
different from zero (Ween Ol) gee De themwremaindar of this 
section, attention will be confined to those particular 
cases where positive correlations are to be expected 
for certain reasons. 

A measure of the reliability of the individual 
tasks and of the Substage III-B criteria can be gained 
by considering the correlations between posttest and 
retention test results on the tasks and on the Substage 
Petetec lacs ilcat One LiGepiiscoctticients fom CV—-A, 
Sy beandeConarecmreescpectively 059 7,430.54 ,.and 0.61. 

For Substage III-B classification the phi-coefficient 
is 0.64. The corresponding contingency tables appear 
tiie nab lesa (p..69)) 

For the classification as Substage III-B, 

62.5% met the criteria on both posttest and retention 
test, 22.5% met the criteria on neither posttest nor 
retention test, and 15% met the criteria on one test and 
not the other. In other words, the classification pro- 
cedure was consistent for 85% of the subjects. 

Formal-operational structures are supposed to form 
an integrated system (Inhelder & Piaget, 1958, Ch. 17), 
therefore the four formal operational tests would be 
expected to correlate with each other. In particular, 
vc and CV-A should correlate with each other. The 


phi-coefficients between VC and CV-A are 0.33 on the 
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PLetes Ge Om thesgqrademmcampile, and) 0736.f0r the 
Grade, Jesamp lesgepothparcemsagnitacantw( pik 05). 

CV -AgandeCy—Sesioulasnot on | Vyacor:elate, but also 
be related to each other in hierarchical fashion. Fewer 
subjects would be expected to fail CV-A and pass CV-B 
iat viCesVvelLsa. ue laoles! 4eshOwsethe relationship between 
CV-A and CV-B performances by the grade 9 sample and 
on the pretest, posttest, and retention tests by the 
grade 7 sample. Phi-coefficients are significant 
ExCepcpin coc posttest torsqrade:/. To determine 
if the predicted hierarchical relationship between 
CV-A and CV-B holds, a test similar to the McNemar test 
used previously is appropriate. The null hypothesis 
in this case is: 

Ho: For those subjects who pass only one 
test, the probability that any subject will pass CV-A 
rand» fail CV—-B (that is P.) is equal to the probability 
that any subject will fail CV-A and: pass CV-B (that is 


Pp That is eee Deal: ore 


d 
1: Pa > ae 


Again a directional test is used since the 


a: 
H 


hierarchical order is predicted. The chi-square values 
aLemnOtes) onic cant slLOoustnesGradew epreLese (x? = .071, 
fee ei 0), PDOSLLeSt (x7 eee me eel Dae 0) % 


Or-retention test ee ee ee bee ae O Jier ane 


TABLE 14 


Relationship Between CV-A and CV-B 


is) 


CV-A 


CV-A 


CV) 


CV-B 


Fail Pass 
Grade 7 Pretest (N = Bi) 


Pass 7 10 


Fic ee 28 7 
Crd om mLOst res Cam UN Go 0) 


Pass 9 5 


Bail 1h iS) 


Grade 7eRetention; Test. (N 


Pass 2 20 


Fail val 4 
Grades oniNe 4} 


Pass 20 eS 


Fail a Eg) 2 


aval 


wo 9% 


ads 


47) 


35° Fs 


oes) 
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chi-square value is significant for the grade 9 test 
(Xm L282 re Cis eee 1010.05.) ¢ 

Significant correlations would also be expected 
between performance on CV-B and CS, the two Substage 
III-B tasks. Table 15 shows the relationship between 
CV-B and CS on the pretest, posttest, and retention 
EeStoelOr Gnade, andetor grade 9, Two of the phi- 
Cosine tcl entem@alessi dnl! Canta p sm .05) 3. (those for the 
Geadem, posttest (0.33) eandstor.the gqrades9ytest (0.36) ; 

An hierarchical relationship is expected between 
Substage III-A classification and Substage III-B classi- 
fication. Table 16 shows the relationship between the 
Substage III-A and III-B classifications for grade 7 
(pretest) and grade 9. The respective phi-coefficients 
SromiOtesi Gi teiccdiis( gee. 00) Lhe predicted hterarchical 
relationship is borne out in the grade 9 results 
(x? mle Oped ie Dee 005) mou LenOt in  themg Lado, 
results (x? =) Bry tebe Ce ak, feh e aitley is UNelel geraleisionnirinelales Cene 
subjects who meet the Substage III-B criteria but not the 
Substage III-A criteria are 16% of the grade 7 sample, 
and 6% of the grade 9 sample. 

To summarize, the results of the correlational 
analysis indicate that the tests and classification 
criteria are fairly reliable. Correlations predicted 
to be significant are generally so, except that the 


posttest grade 7 performance on CV-A does not correlate 
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TABLE 15 


Relationship Between CV-B and CS 


Pass 
CV-B 

Fail 

Pass 
CV-B 

Fail 

Pass 
CV-B 

Fail 

Pass 
CV-B 

Fail 


CS 


Fail Pass 


Grade 7 Pretest (N = 57) 


18 a 

30 | 8 
Grademye OStCest ss (Ne— 

1B 18 

is 6 


Grade 7 Retention Test 


Le iS) 


14 8 


Grade 9 (N = 54) 


6 Hd 


Zul 10 


50) 


(N 


81 
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mw 
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47) 
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TABLE 16 


Relationship Between Substage III-A and 
Substage III-B Classifications 


Substage III-B 
Fail Pass Phi 


Grade 7 (N = 52) 


Pass vil 10g) 
Substage 
III-A 
Fail 16 8 » PAS 
Grade 9 (N = 54) 
Pass 19 24 
Substage 
ei A 


Fail 8 3 rae 
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Significantly with the posttest grade 7 performance on 
CV-B, and that the grade 7 performance on CV-B does not 
correlate significantly with the grade 7 performance on 
CS in the pretest or retention test. 

Two predictions of hierarchical relationship are 
not borne out in the grade 7 sample: CV-A was not pre- 
requisite to CV-B in the pretest or the posttest. The 
Substage III-A classification criteria were not prere- 
quisite to the Substage III-B criteria in the grade 7 
sample. On the other hand, hierarchical relationships 
were borne out quite well in the grade 9 sample. Possible 
explanations of these particular anomalies will be dis- 
eeses below, but all of the anomalies seem to be 
connected with the pretest and posttest grade 7 performance 


on CV-A. 


DISCUSSION 


The implications of the results of the study for 
the idea of intellectual structural limitations are 
discussed in the remainder of this chapter, together with 
some of the possible implications of the correlational 
analysis results for the tests ChHemee tes and for the theory 
of formal operations. The chapter concludes with a note 


COnetnergeneralizdt1on Of the findings tO Others populations. 
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A case has been made that application of the idea 
of intellectual structural limitations to secondary school 
rests on the validity of three statements: 

(a) The onset of the stage of formal operations 
(Substage III-A) does not occur for a majority of students 
until the secondary scnoolsyears or later. 

(b) The full realization of formal operations 
(Substage III-B) is not attained for a majority of students 
during the secondary school years. 

(c) Progression from Substage III-A to Substage 
III-B is necessarily slow (of the order of years) and 
is not subject to appreciable acceleration in the 
secondary school years. 

For structural limitations to be relevant to 
secondary grades, either statement (a) has to be true, 
or else both statements (b) and (c) have to be erue. 
Structural limitations are not relevant to secondary grades 
if statement (a) is false and either of statements (b) 
or (c) is false. 

First consider statement (a). In the survey, 

51% of the grade 7 sample and 80% of the grade 9 sample 
were successful on either VC or CV-A and were classified 
as Substage III-A. Thus statement (a) is false for the 
grade 7 and grade 9 samples. For the grade 9 population 


the statement is false, while for the grade 7 population, 
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the truth of statement (a) is marginal. Standard errors 

of the percentages are about 6% for grade 7, 5% for 

grade 9. A 90% confidence interval around the grade 9 
percentage does not contain 50% (80%18%). A 90% confidence 
interval around the grade 7 percentage contains 50% (51%+t10%). 
According to the pretest, at least 40% of the grade 7 popula- 
tion had entered the period of formal operations while 

at most, 30% of the grade 9 population had not entered 

the period of formal operations. These last two facts 

(made at the 90% level of confidence), together with the 

fact that on the retention test 64% (S.E. = 5%) of the 

grade 7 sample was successful on CV-A, can be taken to 
indicate that statement (a). is very: doubtful. 

Statement (b) is that a majority of secondary 
Sstudentsuaresnotmain SubstageriII-BiemtThe truth of 
statement (b) is marginal Fok both the grade 7 and grade 9 
populations. | Fifty per cent of the grade'9 sample and 
47% of the grade 7 sample were classified as Substage 
III-B. The standard error of these percentages is about 
6%. 

Assuming that statement (a) is false and allowing 
thatuistatement Mb)imis deubtiul, but possubly true, the 
applicabiilitvdot the tdeafoiminteblectual@structural 
limitations to secondary students rests on the truth of 
statement (c): that appreciable acceleration to Substage 


III-B is not possible. In the training experiment, 
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Substage III-B behavior was fairly easily induced in 
a significant proportion of the grade 7 sample. A 
majority of the grade 7 students (68%) was classified 
as Substage III-B both on the posttest and the retention 
test. This represents a significant increase, from 
below 50% on the pretest, following two hours of instruc- 
tion. Of the subjects who took both pretest and retention 
test, half of those who were classified as being below 
Substage III-B on the pretest attained Substage III-B 
classification on the retention test. About one-quarter 
of the grade 7 sample did not display Substage III-B 
behavior on any of the test administrations. A reasonable 
assumption is that alternate and perhaps longer sequences 
of instruction might be successful in inducing Substage 
III-B behavior in a larger proportion of grade 7 subjects, 
or that the same sequence of instruction might be more 
successful with grade 9 students. Therefore a reasonable 
conclusion is that if the idea of intellectual structural 
Pimitationspiseintrinsicallya#validjmitecanwapplystoeno 
more than about one-quarter of the grade 7 population, 
and probably to lesser fractions of corresponding 
populations in higher grades. 

Consider now the results of the correlational 
analysis. As a general rule, these results are consistent 
with the structural integrity of formal operations 


tasks. Significant positive correlations were found 
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among most tests, although none of the phi coefficients 


are of any large magnitude. Two anomalies are: 

(a) the absence of an hierarchical relationship 
between CV-A and CV-B, which seemed to carry over into 
a lack of hierarchical relationship between the Substage 
III-A and Substage III-B classifications for the grade 7 
sample; and 

(b) the low correlation between CV-B and CS, 
the two Substage III-B tasks. 

For the grade 7 sample, the CV-A test was 
anomalous throughout. Although the posttest results 
on CV-A were not Significantly different from the 
pretest results, the retention test results were 
Srcnmentcantlyvebecter than sbOtn.. In the posttest, 26% 
of the subjects failed CV-A but passed CV-B. One 
interpretation is that the validity of either or both 
tests is questionable, since CV-A is supposed to Aerie 
if subjects see the need to control variables, while 
CV-B is supposed to determine if subjects are able to 
control variables. Another interpretation is that there 
may be no necessary difference between seeing the need 
to control variables and being able to do so. This 
interpretation is not borne out, however, by the fact 
that a strong hierarchical relationship shows up in the 


grade 9 sample. Possibly the questions asked in CV-A 


Je, s 
: ew a4 Seca 7 P Oe 


- Ae ot . 
4? Br oo [oe Fe3g 2: 
- a - 


88 


are too vague and ambiguous, and the level of verbal 
comprehension demanded is too high for grade 7 subjects. 
The requirement of verbal responses in this test probably 
increases the probability of Type II error (Brainerd, 
1974). If so, the proportion of Substage III-A subjects 
in the grade 7 sample was probably underestimated. 

' The low correlation between CV-B and CS raises 
theoretical problems. If formal operations are structured 
in an integrated system, then the results of all formal 
operational tasks should correlate. For the grade 7 
posttest and the grade 9 test the correlations are 0.33 
SonomUGt One OURLICMULACCmeDLeLesteandmrelentron, test, 
the correlations are both 0.21. The maximum possible 
values of phi corresponding to the marginal totals is 
the two cases where significant correlations occur are 
0.82 and 0.85, so that the phi-coefficients obtained can 
be considered to indicate moderate correlation. In the 
case of the grade 7 pretest, the difficulty of CS probably 
explains the low coefficient: only nine subjects passed 
CS and the maximum value of phi for the corresponding 
marginal totals is only 0.38. The low correlation in 
the grade 7 retention test remains anomalous. If the 
four phi-coefficients corresponding to the four tests 


were averaged, the mean value would be 0.38. Bearing 
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in mind that the phi-coefficient between posttest and 
retention test administrations of CV-B and CS were 0.54 
and 0.61 respectively, a moderate correlation could be 
considered to exist between CV-A and CS. 

Bredderman .(1973) found the results Ofetestseol 
controlling variables and generating combinations to have 
qeeeal sone. Or 04 Jeeebredderman Ss results include 
Sejorelation withementalwage, and 1f£ this as) partialled 
Sule Bem coll tSeOlelheetwoetests shave a partialeyn of 0.31. 
in a correlational study, Bart (1971) found Pearson 
coefficients among four Piagetian tests (GLT-3, -8, -1l, 
met Os range trom.0.455tO10./5.) lhe theory of an 
integrated structure for formal operations suggests that 
high correlation coefficients among formal operational tasks 
should be the rule. The matter warrants further investi- 
gation. 

Finally, with reference to generalization to other 
populations, there is no reason to believe that the 
populations studied contained unusually large proportions 
of especially bright students, or students who had been 
exposed to especially rich) learning) expervences Therefore 
the findings of this study can fairly safely be generalized 
to other populations except where unusual cultural 
deprivation is found, or where the ability distribution 


is negatively skewed. The finding that the idea of 


90 


intellectual structural limitations does not apply to 
the grade 7 population may be extended safely to higher 
grades. There is no reason to suspect that the idea 

of intellectual structural limitations has any more 
applicability tovnormalsurban secondary: school popula- 


tions than it has to the population studied. 


CHAPTER V 
SUMMARY, CONCLUSIONS, AND RECOMMENDATIONS 


The study represents an empirical test of the 
applicability of the idea of intellectual structural 
limitations to secondary school students, that is that 
certain kinds of learning are not possible for students 
in secondary school grades because they lack formal- 
operational structures and cannot develop them readily. 

The research was in two parts, a survey and an 
experiment. In the survey, samples of grade 7 and grade 9 
students from three rural high schools were tested using 
a volume conservation task, two controlling variables 
tasks, and a combinatorial system task. In the experiment, 
the grade 7 sample was then administered two one-hour 
instructional sequences spaced over a one-week period. 

One sequence consisted of instruction in controlling 
variables, and the other consisted of instruction in 
generating combinations. At the end of the instructional 
sequence and again one month later, the controlling 
variables and combinatorial system tasks were readministered. 

On the basis of performance on the tests, and 
in accordance with theoretically derived ree 513% 


of the grade 7 sample and 80% of the grade 9 sample were 
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classified as being at least in Substage III-A of the 
period of formal ppeie tench while 47% of the grade 7 
sample and 50% of the grade 9 sample were classified 

as being in Substage III-B of the period of formal 
Operations. After the instructional sequences, 683% 

of the grade 7 sample was classified as Substage III-B 
on both the posttest and retention test. The increases 
in the proportion of Substage III-B subjects from the 
pretest to the posttest and from the pretest to the 
retention test are significant at the five per cent level. 
These results were interpreted to indicate that the idea 
of intellectual structural limitations does not apply to 
secondary school grades. 

A correlational analysis of the results indicated 
that the tests are fairly reliable, and that as a general 
rule, they intercorrelate according to predictions of 
the theory of formal operations. Exceptions were (a) 
that the two controlling variables tests failed to display 
consistently a predicted hierarchical relationship, and 
(b) that one of the controlling variables tasks and the 
combinatorial system task, both designed to classify 
subjects as Substage III-B, did not correlate with each 


other above a moderate level. 
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CONCLUSIONS 


The results indicate that if the idea of 
intellectual structural limitations applies to secondary 
school grades at all, it can apply to no more than. about 
25% of grade 7 students, and to lower DLOPOLELONS EOL 
students in - higher grades. Of 34 grade 7 subjects who 
had not displayed Substage III-B behavior on the 
controlling variables task in the pretest, 16 displayed 
SuctmDenavLGOUeabtereagtOtai Of only One hour seinstruction 
in controlling variables. Of 41 grade 7 subjects who 
had not displayed Substage III-B behavior on the com- 
binatorial system task in the pretest, 18 displayed such 
behavior after a total of only one hour's instruction 
in generating combinations. A reasonable assumption 
would be that more instruction would be successful with 
more of the subjects. 

If the idea of intellectual structural limitations 
were to be applied in the selection of learning experiences 
for the grade 7 sample studied, then in their schooling no 
attempt would be made to have the abilities to control 
variables or generate combinations used, applied, or 
practised by those subjects whose pretest behavior 
indicates that they do possess these abilities, and no 
attempt would be made to develop these abilities in those 
subjects whose pretest and posttest behavior indicate that 


they can readily acquire the abilities. 
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There are two possible interpretations of the 
results alternative to the interpretation given above. 
The first alternative is that the gains between pretest 
and retention test are a function of the testing procedure 
and independent of the development of any abilities. The 
second alternative is that no structural changes were 
induced in the instructional sequence but that subjects 
learned behavior specific to the tests. These alternative 
interpretations will now be discussed. 

In the first alternative, the hypothesis is that 
in the experiment, subjects learned nothing more than 
how to do the tests. According to this interpretation, 
some subjects may have understood the need to control 
variables, or been able to control variables, or been 
able to generate combinations, but yet failed the pretest 
because they did not understand what was required in the 
tests, or because they were anxious in the test situation, 
or because they were distracted by the strangeness of the 
unfamiliar requirements. As these subjects became more 
familiar with the tests on successive administrations, 
they performed up to their abilities and went from fail 
to pass. This interpretation would mean that both 
statements (a) and (b) are false: that a majority of 
subjects in secondary school has attained both Substage 


IZTI-A and Substage III-B as indicated by the results 


95 


Ofetherorades /eretentionstest.. The interpretation further 
implies that a single administration of a formal-operational 
task to a group of subjects would always result in under- 
estimation of the proportion of formal-operational subjects. 
With the second alternative, the hypothesis is 
that the Substage III-B performance on the posttest and 
retention test is specific learned behavior and not the 
result of structural change in subjects. This would mean 
that in the same theory, the same behavior allows of two 
contradictory interpretations. In one case (the pretest) 
the behavior (B) infers a structure (S), while in another 
case (the posttest), B infers not S but specific acquired 
habits. This would mean that except in an entirely 
novel situation, S is not a necessary condition for B. 
To make matters worse, S can never be a sufficient 
condition for B, because a subject cannot be successful 
on a task (by emitting B) unless he understands what he 
is being asked to do, is attending, and is willing to 
cooperate. If S is neither a necessary nor a sufficient 
condition for a given B, then structure must be a very 
mysterious entity andthe results of Piagetian tasks must be 
very hazardous bases for important decisions about the 
kinds of learning experiences to which students should be 
exposed. 
The fact is that at the end of the experiment in 


the present study, two out of three of the grade 7 subjects 
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could control variables in testing the stiffness of 
planks, or generate combinations of accessories to a 
radio, or both. A total of seven classroom periods-- 

less than a day's school--was taken up in instruction and 
testing to reach this point. To say that such abilities 
(being formal-operational) cannot be learned by secondary 
school students (because they are concrete-operational) is 


false. 


RECOMMENDATIONS FOR FURTHER RESEARCH 


The relationship between CV-A and CV-B warrants 
Pigcicreinvestigatvon: = The question 1s whether there is 
any necessary time-consuming developmental process 
between recognition of the need to control variables 
and being able to control variables in a given situation. 
In order to answer this question, tests for the recognition 
of the need to control variables would have to be developed 
which are less verbal. In order to control for the effect 
of differing content, CV-A and CV-B items could be paired 
on the basis of similar content. For example, a Substage 
III-A controlling variables task involving planks of wood 
could be paired with a Substage III-B task involving 
planks of wood. The reliability of each measure could 
nts ae by having a number of items, but this would 


lead to a problem in keeping the length of the tests down. 
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The forms of the questions would have to be abbreviated 
from those used in the present study. 

Similarly the relationship between CV-B and CS 
Menitcsmcurtnereattention wand controlling, for the effect 
of content would seem to be necessary in this case also. 
If content is controlled for, then the theory of formal 
Operations would predict that the two measures ought to 
correlate to a level at least comparable with their 
reliabilities. 

Close attention could well be paid to that 
minority. of subjects who continued to fail to meet the 

Substage III-A and Substage III-B criteria after the 
instructional sequences. These subjects could be 
screened out using the procedures of the present study. 
An appropriate initial approach to these subjects might 
consist of clinical interviews or longtitudinal case 
studies. 

A careful attempt to detect the predicted 
interaction between age and instruction, which did not 
show up in the experiment, could well be made. Piaget's 
notion of "natural" development definitely implies that 
sucheansinteraccron shoulcsoccur melt, masmthes LesuLts 
of the present study suggest, development in the period 
of formal operations can be influenced by specific learning 


experiences, then an interaction between educational 


environment and instruction might be more likely. 
Students from richer educational environments might be 
less likely to show gains as the result of instruction 
than students from poorer environments. The design of 
an experiment oriented to this problem might be quite 
difficult, but it would represent a clear test between 
predictions from alternate hypotheses. 

Finally, the results of the present study should 
NOL Le Laken LO aindicaces=that most secondary students 
are "fully formal operational" in the sense of being 
aoe mCOmSOLVemCOnecc Ly st neonOlOblemseg1 ven ine lie 
GLOWwLn OfeLogical Thinking. “The™tacts that the grade 9 
sample performed better on the tasks than the grade 8 
sample, and that significant training effects were found, 
indicate that students' thinking ability is developing 
in the secondary school years. Therefore the final 
recommendation is for a study of the effects on subjects 
of varying age and ability levels of long-term instruction 
in formal-opertional schemes and abilities, such as 
“proportionality, the combinatorial system, controlling 
variables, and) fOrmale Ogi Con oUuCheamsLUdyemi git ewe 
result in’the finding that-contrary to*’the idea of 
intellectual structural ee senate the more exposure 


to and practice in formal thinking, the better. 
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APPENDIX A 


TESTS 


Volume Conservation 


The response sheet for the volume conservation 


task is reproduced as Figure 2. The instructions for 


the tester are as follows: 


ine 


Say "Write your name, sex, and birthday at the top 
of the answer sheet. After each question, don't go 
back and change anything." 

Place three glasses on a table in front of you so 
that all subjects can see. Fill each glass to the 
Mark. Draw subjects' attention to the fact that 
the level is the same in each glass. 

Show subjects the six plasticene balls which are 
the same size and point out that they are the same 
Size. 

Say "What do you think will happen if I place this 
ball in this [middle] glass? I want to know if it 
will take up any room in the water. Where will the 
water come to? Lets try it." 

Place the ball into the middle glass and say 

"See? The water came up to here." 

Say "Now if I put this ball [fone the same as the 


first] into this [left-hand] glass, I want to know 


3 
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how much room it will take up in the water. Where 
will the water come to? You can show me by putting 
a line on the third glass in the answer sheet. Put 
it beside number one. Draw a line on the third 
glass to show where you think the water will come to. 
I'll show you. If you think the water comes to here 
[indicating a level on the left-hand glass], draw a 
line like this [demonstrate on an answer sheet or a 
blackboard]. If you think the water only comes to 
here put whealanes  ikewthises Dositanow.n @Pause, 
then ask " Is-apybody not sureswhateto!do?fss If any 
subject is not sure, repeat the explanations once. 
Say "Let's try it." Place the ball in the glass 

ONS YCUGELELe ae caysnocecea Cheswaters comess ito here-- 
the pats as in this [middle glass]. If you weren't 
correct, please don't go back and change your answer." 
Remove the ball from the glass on your left. 

Take the small plasticene ball. Say "Now let's 

take this small ball. I want to know how much 
LOOnsLeewiuletakecmupes rn Sie Wfcketssen lhe JR jeitke yeep icy 
ball in the water, where will the water come to? 

Mark it beside number two on the answer sheet. 

Do it now. Number two." 

Wadtr  ChenesaVia ONe LetuSmcr Volt see acesthe ball 

in the glass on your left and say "See? The water 


came up to here." 
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10. Repeat 8 and 9 three times substituting first, 
the large plasticene ball (answer sheet number 
three); second, the steel ball the same size 
as the six standard balls (answer sheet number 
four); and third, the small lead ball (answer sheet 
number five). | 

ll. Say "Now let's do something else. We start with 
these two the same size. [Indicate two of the 
Standard balls.) SWeetake this one? and Troll £t 
OQUGHIntOsa Sausages elie Leaput the» ball=ain: the 
water, the level comes up to here [indicating the 
level in the middle glass]. What if I put the 
Sausage into the water? I.want to know how much 
room the sausage will take up in the water. Where 
will the water come to? Mark it beside number six." 
Do not demonstrate immersion this time. 

12. Repeat step 11 twice, first breaking a ball up 
into five approximately equal pieces; and second 


squashing a ball into a "pancake. 


The question and answer sheet for CV-A is 
reproduced on page 121. The instructions for the tester 
were as follows: 

1. Give out the answer sheets. Caution subjects not 


to say anything, just to write. Have them write 
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their names on the sheets. 

Hold up the brass and aluminum rods, one in each 
hand, so that the rods are vertical, about 50 cm 
apart, »thesbrasserodgon syour sleft.. «Say, "These: are 
the rods you are supposed to look at." Allow five 
seconds, then hold them horizontally, Side-by-side, 
about 3 cm apart, the aluminum rod on top aligned 
at neither end. Allow five seconds, then place 
the rods about 3 cm apart on a desk or table so 
that they stand vertically with the brass rod on 
VOUr right. 

Read the test to the subjects saying "Question one 
Say Sisal sarees). 

Allow 10 seconds for subjects to write answers 
to question one and two minutes for question two. 
Stop the test 20 minutes after you begin. 


Note: if subjects and questions related to clarifying 


thestest,asey. .JOmwnal you think issright. 


CV>B 


The question and answer sheets for CV-B are 


reproduced on pages 112-113. The instructions for the 


tester were as follows: 


is 


are 


Give out the kits of wood and the tests. 
Have any new subjects write their name, sex, and 
birthday at the top of the answer sheets. Other 


subjects write only their name. 
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Have subjects line up their planks on their desks 


in alphabetical order with the left-hand edges 


aligned. Say "You can move the planks around all 
you want when the test starts, but they should 

be like that to begin with. 

Say "See that some are pine and some are redwood. 
Have the markings up. Now when we say ‘'thickness' 


we mean this way [indicate] from top to bottom. 


That's thickness. This way [indicate] is width. This 


WayelincL cartel io Lengtna geruteyour finger son a 


WIldG,. tiLeck plank. “= Check that? cach subject 1s 
DOU COM amCOMLeCC EMD Lan se Odyaa ULeyOUL Linger 
Onwagnareow we taiten Olank. = Cleck again. — &xplain 
the difference again to any subjects whose responses 
indicate confusion between length and thickness. 
Saye Now tet 1s reed the teste |= Slowly read through 
the entire test. 

Say "Now it's very important that you say which 
Diane) OUsWOUlds Se =cCOs COmUllemles So mmm aio tie 
planks by their letters, for instance you can Say 
'olankeAnw,;.Or planks wredndescOsOn. 

Tell subjects to begin. Allow 20 minutes for the 


entire administration. 
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The question and answer sheets for CS are 


Zeproduced sas Figuresmomand 7... lhe instructions for 


the tester were as follows: 


1. Say 


test. 


PYOup ie havemcOrminuces to vtry this little 


Looks at i2£,- and let's read it together." 


2. Read it slowly with added explanation as follows: 


(a) 


(b) 
(c) 
(da) 
(ce) 
(f) 
(g) 


(h) 


after 


Gad OmmeLniwlinecwlesaymitkerthatelittie 


picture. 


say "for" before the price each time. 


after 
after 
after 
after 


after 


"$10" say "that's those crisscross lines." 
Eo 2mescaymml natecniingsat therside.” 

"$1" pause five seconds. 

"draw" say "whichever. you like." 


"$45" say "Let me show you another one. 


Yous couldshavyestie strap and the battery 


[draw on chalkboard] for $40 plus $5 plus $2. 


WWinysme abe” gy 


after 


"Sheets" ask "Is anybody not. sure what 


hor tlote 


3. Answer any questions related to what to do, how to 


write answers, etc. Record such questions. Do not 


answer questions related to possible number of 


Combinations mod VmelewanltevOlstOmElgure:iteout, . 


=— = 4 


Sade expe ton 
7 Sty 


peat, 


OGe wont tece ImvOUetiatiwmernat Sapart or the test, 


4. Allow 15 minutes for subjects to respond. 
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TEST NUMBER TWO 2a 


Look at the two metal bars being held up by the tester. 
The silvery one is made of ALUMINUM. 

The other one is made of BRASS. 

A student wanted to find out if rods made of brass bend 
more easily than rods made of aluminum. The student tried 
to bend each rod. The pictures show what happened. 


BRASS Rop 


QUESTIONS: 


1. Does this prove that rods made of brass bend more easily 
than rods made of aluminum? 


Ze aWhy or why snot? 


Bum Give twOeways you Could make this a, better test, to see 
if rods made of brass bend more easily than rods made 
of aluminum. 


(1) 


(2) 


ie 
TEST NUMBER FOUR 


How to test the stiffness of a wooden plank:~— 


A wooden plank can be tested for stiffness by placing 


PomOnetwo, blocks like this, 


The less it bends, the stiffer it is. 


You have been given a number of planks, some wide, some 
narrow, some thick, some thin, some short, some long, some 


made of pine, some made of redwood. 


TEST FOUR CONTINUED 23 


1. Suppose you want to find out if the length of a plank 
makes any difference to its stiffness. How would you 
find out? (say which planks you would use) 


Bemeouppose you want to: find out @i-the=thickness of a plank 
makes any difference to its stiffness. How would you 
do a test to find out? (say which planks you would use) 


3. Suppose you wanted to test whether the kind of wood 
affects the stiffness of a plank (pine or redwood). How 
would you find out? (say which planks you would use) 
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TEST NUMBER THREE 


A transistor radio COstS) y40;, Dut you Can) buy any. 


of the following "extras" if you want to: 


leather case ers S10. 
— 


951. 


OR extra battery = S San 
OR ear phone Sle 


Suppose you want to buy a radio. Write down or draw as 


OR carrying strap 


Many different choices as you can think of, and the prices. 


For example 


radio + case + earphone Soul 


radio + strap $45. 


Put your answers on the special answer sheets 
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APPENDIX B 


INSTRUCTIONAL SEQUENCES 


Controlling Variables Lesson One 


Sugar experiment. Say "I'm going to prove 

that sugar dissolves more quickly in a short glass than 
ineagcalie glass, OK? = Watch. Fall sthe tall) glass about 
two-thirds full with water, the short glass almost full. 
Place a tablespoon of sugar in the tall glass and stir 
for about eight seconds so that not all dissolves. Then 
place a teaspoon of sugar in the short glass and stir 
rapidly until all dissolves. Say “that proves sugar 
dissolves more quickly in a short glass right?" Discuss 
the experiment with subjects asking how to make it a 
"good test." Establish that equal volumes of water, and 


equal amounts of sugar are required. 


Rubber experiment... ody e. Ll havessome™ rubbers bands 
here" and produce a number of red and blue rubber bands 
Varying according to) thickness, of rubber swildcheOoLerubber, 
and length. Say "someone told me red rubber is 'stretchier 
thanvblue. rubber.» Letecetryel taolte mete IepDUtCTaaWelone 
[large wrench] on this red one [large, thick] and see how 
far it stretches. Now put a weight [small pliers] on this 
blue one [small, thin] and see how far it stretches." Hold 


the two weighted rubber bands side by side, then put down 
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the weights and hold up the rubber bands side by side. 
Say "So that proves it--red rubber is stretchier than 
Dives rubberyeright: " Sltesubsects°obyect’ that’ the bands 
are not the same size, repeat using bands the same length 
but different width. Continue, secrete with subjects, 
until the requirements of equal weights, and equal 


dimensions= tore the rubber pands are’ established 


Controlling Variables Lesson Two 

Proiueoulbs am laCcescnmae tables between =1nsStructor 
and subjects the lamp base and the array of light bulbs 
(as shown in Figure 1 [p. 53 J). Mount lamp B (24W clear 
picture bulb) in the base and switch on, then off. 
Demonstrate the photographic light meter. Indicate 
where the light enters and show where the needle registers. 
Switch on the picture light and show the needle reading. 
Switch off to show how the needle returns to zero. Say 
"This is a 25 watt light bulb. How can we peal a(sll Coxbuey She 
lignis bulbs with higher wattsemake themneedile go upsmore: 
Through discussion, establish which pairs of bulbs provide 
a test of whether the wattage of the bulb affects the 
meter reading. Repeat the procedure to establish proper 
tests of whether meter reading is affected by shape of 


bitlb, core Lrosting. 


— 


; c _) rd. eee 
[deseo oleae hd, dp 
en ae rene Ph tien ee 
es) 


+» sorte 326 22 
> aetisorla 20 2% 


128 


Papeiotbipsssmodyerlet Sutryesonceningedifterent. ” 
Give each student a package of paper strips. Say "Put 
these on your desk. There are two colours, two lengths, 
two widths. Pick out a pair the same colour and width, 
but with different lengths. [Check, help individuals 
where necessary.] Find another pair. AnOcnemeas Another, & 
Repeat this procedure, Fieee fOvE pal Boma LELeming only 
Pimwicdtht tien, LOra pales dittering only an colour. | Do 


not mention controlling variables or experimental tests. 


Rubber tubes. Show subjects the array of rubber 
Cubes: diatiiering in length, ‘diameter, and *colour /(red™and 
black). Ask individual subjects to select pairs of 
tubes "the same except for length" then "colour," then 


menicknessh.. 


Metal rods. Repeat the rubber tubes procedure 
using the array of steel and copper rods of varying 
length and diameter. 

WaskSmiOre lOUuay. 

The third lesson in the controlling variables 
sequence was, together with the third lesson in the 
combinatorial sequence, formed the classroom assignment 
entitled "Tasks for Today," which is reproduced on 


Pegesio2 tole ls. Questions aa? mend so slormed) Lesson 
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three--the combinatorial sequence, while question 4 

and 5 formed lesson three in the controlling variables 
Sequences Instructors awere atold: tochandijiout the wsheets 
to subjects and ask them to begin at question l. 
Instructors were told to help subjects as requested, 
and to direct subjects to begin question 4 after 20 


minutes irrespective of progress on questions 1,.2, and 3. 


Combinatorial Sequence Lesson One 
Objective.) ssubjectsewill understand the 
triangles problem: given a packet containing 40 card- 
board itriangles—-i10; bine, 10 black, 10 red, 10 vellow, 
place none, one, two, three, or four triangles in baking 
cups according to the following rules: 
(a) no two baking cups have similar contents 
(b) only one triangle of any one colour in any one 
baking cup. 
Stated behaviorally: places triangles in baking cups 


GOMreECL. Vraccording stowaboves ries: 


Equipment )® ‘For seachusubject: 
(a) one white envelope containing 10 red, 10 blue, 
10.:yellow, L0*bback. cardboard triangles; 


(b) set of 20 baking cups clipped with paper slide. 


Method. als) soay es lem) qgoingatosget you) to do a 


fittlestask. You might find it very easy, you might not.” 
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2. Give out the paper triangles, and say "There should 
be 40 triangles in your envelope. Empty them out on the 
desk. See that there are ten of each colour." Wait 
until everyone is sure of having the correct number of 
triangles. 

Beemcay)  6OK Ainow el Imegoing eto sive sen Oe baking cups 
and then I'll explain the rules of the task." Caution 
students that cups are light and blow around easily, 

and tell them to put the cups out in rows on the desk. 

Ao ay "Here are the rules. You have to put none, one, 
two pethiec ;cor efour strianglessineasecup-, For.instance, 
aeredmandeasbluc, OGejuUStpasred; mon a Sneek, e yellow 
wend a blue. [Demonstrate.] You can't have more than 
one triangle of the same colour in any cup. [Demonstrate.] 
No two cups can have the same sets of triangles in. You 
have to make as many different sets as you can. 
[Demonstrate.] Any questions? Go ahead and make as many 
diiterentescts taSyyouscansas eihescolourspohethe cups have 
nothing to do with the experiment. 

5. Point out mistakes and explain as class works. 

6. Get students to clip cups together and put triangles 
back in envelope. 


Note: assist wherever necessary--this is 


instruction, not testing. 
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Combinatorial Sequence Lesson Two 
1. Say "Today I will give you a way to do the task 
Woe etheecolouredme ta ang ast. 

2. . Give subjects each a packet of triangles and a pink 
sheet ruled into 20 squares. 

3. With the aid of four large triangles, show subjects 
how to generate the combinations by taking one at 
a time, two at a time, three at a time, and four 
at a time. 

Ao Let subjects #ftrysitvwoncecwithetherpaperetriangles, 
assisting individuals where necessary. Each combina- 
tion is to be placed in a square on the pink sheet. 

5. Take back the packets of triangles and give subjects 
packets containing 10 plastic counters ("bingo chips"), 
10 paper slides, 10 papersfastenérs, and 10 pennies. 
Ask them to do the same thing with these objects 
instead of the triangles. 

6. Have students return the packets of objects, but 


allow them to keep the pennies. 


TASKS FOR TODAY 


1s 


oe 


Make aS many combinations as possible with the set of 

4 kinds of objects in the packet supplied. Use the 
pink sheet ruled in squares to put the combinations on. 
Ask for assistance if you need it. 


When you have finished number 1, pick one of the following 


and do it:-- 

A. CLOTHES. 

B. HAMBURGERS. 
GaeaCARS. 


Suppose you have bought: 

(1) a new belt 

(2) a new hat 

(3) a new sweater 

(4) a new pair of sunglasses 

Figure out all the ways you could wear 
these. For example you could wear the 
belt and hat, or just the sweater or all 
four. Use the pink sheets to write down 
the combinations (or draw them). 


With a hamburger you can get: 

(1) ketchup 

(2) relish 

(3) cheese 

(4) onions 

Figure out all the different ways you could 
have the hamburger. Use the pink sheet 

to put down your answer. 


When you buy a new car you can get, as 
extras, any Of these: 

(1) whitewall tires 

(2) automatic transmission 

(3) "radio 

(4) heater 

Figure out the different ways you can buy 
the car. For example you could get the 
whitewalls and the radio, or just automatic 
transmission, or all four. Use the pink 
sheet. 
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3. A square ( ) can have a circle in it 


a 
Ge 


a cross.in.it 


ae DoOrdersonw ia 


or a dot in the 
middle 


Work out all the different ways you can have a square. 
For example: 


and so on, 
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4. 


Look at the sheet with pictures of screws, buttons, and 


cups on it. Try these: 


A. 


The screws are different lengths and thickness, 
and have different shaped heads. 


Pick out two that are the same except 
for length: and 


Pick out two that are the same except 


_for shape of the head: and 


Pick out two that are the same except 
for thickness: and 


Another two: and 


The buttons are different in the following ways, 
some have rims, some don't. Some have two holes, 
some four. Some are large, some small. 


Pick out two that are the same except 
for different rims: and 


Pick out two that are the same except 
for size: and 


Pick out two that are the same except 
for the number of holes: and 


The glasses are different as follows: some have 
a wide base, some a narrow base. Some are tall, 
some are short. Some are "Straight-sided" while 
others are "tapered." 


Pick out two glasses different only in 
height: and 


Pick out two different only in shape and 


Pick out two different only in size of 
the base: and 
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Buttons 


2 Wee 


Glasses 
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The belts problem 
The belts differ as follows: 
in length 
in width 
in shape at the end (pointed or straight) 
in shape of the buckle (square 
or round SS) ) 
I. Pick out two belts the same except for 
width: and 
iiwellcK Out two the same except for type 
of buckle: and 
III. Pick out two the same except for shape 
at the end: and 


IV. Pick out two the same except for 
length: ia and 
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